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INTRODUCTION 


This  "Semi-Annual"  Technical  Report  covers  research 
carried  out  by  the  Advanced  Teleprocessing  Systems  group 
at  UCLA  under  ARPA  Contract  MDA903- 77-C-0272 ,  covering  the 
period  August  1,  1977,  through  June  30,  1978.  The  research 
herein  conducted  is  basically  a/ continuation  of  our  former 
ARPA  Contract  Number  DAHC-15-73-C-0368 ,  which  was  also 
devoted  to  Advanced  Teleprocessing  Systems.  Under  this  new 
contract,  we  have  two  designated  tasks  as  follows: 


Task  1:  Radio  Packet  Switching  Systems 


Advanced  studies  regarding ^he  fundamental  analytic 
and  design  considerations  for  random  multiple^^.. 
access  radio  packet  switching  systems.  We  will 
investigate  the  basic  performance  measures 
including  capacity,  stability,  control,  routing, 
and  the  tradeoffs  among  these  quantities  for 
ground  and  satellite  packet  radio  systems,  &  >■  c,  ( 


Advanced  Research  in  Distributed  Communications 


Advanced  studies  ift).  internetting ,  flow  control, 
distributed  access,  fundamental  capacity  definitions 
and  contours,  and  investigation  of  the  underlying 
cost-performance  behavior.  '>  (<\J) 

We  have  made  significant  progress  in  the  two  named  tasks 
In  the  following  paragraphs,  we  describe  the  progress  and 
give  pointers  to  those  references  which  represent  the 
published  work  resulting  from  this  supported  research. 

Following  this  short  summary  is  a  list  of  publications 
produced  as  a  result  of  the  recent  research  on  this  contract 
covering  the  period  being  reported.  This  list  contains  only 
those  articles  and  reports  which,  in  fact,  did  appear  in 
print  during  this  period.  Papers  which  have  been  submitted 
(of  which  there  are  many)  are  not  listed  here,  but  will  be 
listed  in  future  reports  as  they  appear  in  the  published 
literature.  We  devote  the  main  body  of  this  report  to  the 
detailed  presentation  of  one  aspect  of  this  overall  research, 
and  we  simply  mention  the  other  areas  briefly  in  this  summary 


1  Or-  ^  7 

6  \  » cm  s 

The  research  reported  in  the  main  body  of  this  document 
discusses>various  switching  and  flow  control  techniques  as 
applied  to  packet  communications  networks, — Described  r-herein>  r  'A 
i si  a  new  switching  technique  called  ^virtual  cut-through*.  A 

and  its  performance  is  analyzed  and  graphed.  Moreover,  a  _ J 

^comparison  is  made  between  this  new  switching  scheme  and 
message  (or  packet)  switching  and  circuit  switching.  Further¬ 
more,  ^  window- type  flow  control  procedure  is  defined  and 
evaluated.  We  study  both  static  and  dynamic  flow  control 
procedures  and  describe  an  optimal  control  policy  to  dynamically 
select  window  size.  A  network  algebra  for  interconnected 
networks,  both  in  series  and  in  parallel,  is  developed  and 
evaluated. :* Further,  buffer  allocation  schemes  in  which  a 
number  of  channels  converge  at  a  single  node  which  must  share 

a  finite  buffer  in  that  node  are  analyzed  and  evaluated.  - 

The  particular  details  in  the  following  report  represent  the 
Ph.D.  dissertation  of  Parviz  Kermani  (Chairman,  Leonard 
Kleinrock)  and  the  work  is  entitled,  "Switching  and  Flow 
Control  Techniques  in  Computer  Communication  Networks." 

Turning  now  to  the  list  of  references,  we  comment  that 
reference  #1  gives  a  global  overview  of  distributed  multi¬ 
access  computer  communication  systems,  placing  the  various 
access  schemes  in  a  commun  framework  and  identifying  the 
cost  which  nature  will  extract  for  distributed  communications. 

New  measures  of  performance  and  a  reduced  set  of  parameters 
are  identified  in  this  work  which  have  turned  out  to  be 
extremely  useful  in  evaluating  new  and/or  old  access  schemes. 
Reference  #2  represents  a  significant  piece  of  work  in  the 
field  of  stream  packet  communications  (e.g.,  packetized 
voice).  Reference  #3  gives  a  very  effective  control  method 
for  maintaining  slotted  ALOHA  as  an  access  scheme  in  a 
stable  mode;  simulation  experiments  are  extremely  encouraging 
regarding  this  very  simple  control  method.  Reference  #4 
examines  stability  and  dynamic  control  in  ground  radio  packet 
switching  and  gives  a  mathematical  evaluation  of  the  various 
stable  and  unstable  modes  and  their  basic  performance. 

Reference  #5  presents  some  measurement  results  of  the  ARPA 
packet  satellite  experiment  and  shows  the  way  in  which  various 
access  schemes  compare  to  each  other,  both  from  a  theoretical, 
simulation  and  experimentally  measured  point-of-view. 

Reference  #6  presents  a  study  of  ground  radio  access  to  a 
station  through  a  multi-hop  tree-type  topology;  we  find  that 
very  few  buffers  suffice  at  each  node  due  to  the  half-duplex 
nature  of  the  transceivers  (repeaters).  Reference  #7  is  the 
main  body  of  the  support.  Reference  #8  refers  to  a  significant 
work  in  the  evaluation  of  hierarchically  structured  wire 
and  broadcast  networks;  here  one  observes  the  gains  to  be 
had  through  hierarchical  organization  and  the  various  modes 
in  which  one  can  interconnect  communicating  terminals  to 
increase  their  capacity  at  fixed-delay  or  to  minimize  delay 
at  a  fixed  network  investment.  Reference  #9  describes  a  new 


ii 


optimal  access  scheme  for  ground  radio  packet  switching 
which  promises  to  be  extremely  useful  for  multi-hop  ground 
radio  networks.  Reference  #10  gives  an  overview  of  global 
flow  control  in  a  network  environment.  Here,  a  new  definition 
of  the  cost  of  a  flow  control  scheme  as  well  as  some  funda¬ 
mental  optimization  methods  is  presented.  Reference  #11 
examines  the  effect  of  various  acknowledgment  mechanisms  in 
packet  switched  radio  channels  so  that  one  can  sensibly 
select  from  among  the  various  schemes  available  today.  The 
research  reported  in  these  published  articles  is  continuing 
and  we  are  currently  investigating  new  areas  as  well. 

The  main  report  on  switching  and  flow  control  in  packet 
networks  is  given  following  this  list  of  publications. 


LIST  OF  PUBLICATIONS 


1.  Kleinrock,  L.,  "Performance  of  Distributed  Multi-Access 
Computer -Communication  Systems,  Information  Processing  77, 
Proceedings  of  IFIP  Congress,  Toronto,  Canada,  August  1977, 

North  Holland,  Amsterdam,  1977,  pp.  547-552. 

2.  Naylor,  W.E.,  "Stream  Traffic  Communication  in  Packet 
Switched  Networks,"  Ph.D.  Dissertation,  School  of  Engineering 
and  Applied  Science,  Computer  Science  Department,  University 
of  California,  Los  Angeles,  September  1977. 

3.  Gerla,  M.  and  L.  Kleinrock,  "Closed  Loop  Stability  Controls 
for  S-ALOHA  Satellite  Communications,"  Proceedings  of  the 
Fifth  Data  Communications  Symposium.  Snowbird.  Utah. 

September  1977,"  pp.  2-10  to  2-19. — 

4.  Tobagi,  F.  and  L.  Kleinrock,  "Packet  Switching  in  Radio 
Channels:  Part  IV  --  Stability  Considerations  and  Dynamic 
Control  in  Carrier  Sense  Multiple  Access,"  IEEE  Transactions 
on  Communications,  Vol.  COM-25,  October  1977,  pp.  1103-1119. 

5.  Gerla,  M.,  L.  Nelson  and  L.  Kleinrock,  "Packet  Satellite 
Multiple  Access:  Models  and  Measurements,  Proceedings  of  the 
National  Telecommunications  Conference.  Santa  Monica” 

California ,  December  1977,  pp.  12.2-1  to  12.2-S. 

6.  Tobagi,  F.,  "Performance  Analysis  of  Packet  Radio  Communication 
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ABSTRACT 


The  objective  of  this  research  is  to  develop  analytic  models 
for  advanced  performance  evaluation  of  computer-based  communication 
networks.  The  emphasis  is  on  flow  control  mechanisms  and  on  models 
for  switching  techniques. 

A  new  switching  technique,  virtual  cut -through,  is  proposed  and 
its  performance  is  analyzed.  Through  the  comparative  study  of  different 
switching  schemes  (virtual  cut -through,  message  (or  packet)  switching, 
and  circuit  switching),  we  establish  certain  ground  rules  which  aid  the 
network  designer  in  deciding  which  switching  method  to  use  in  a  data 
communication  network. 

Flow  control  procedures  for  computer  networks  are  investigated 
and  analytic  models  for  the  "window"  mechanism  are  developed.  We  study 
both  "static"  and  "dynamic"  flow  control.  In  static  flow  control,  param¬ 
eters  of  the  system  are  not  adjusted  to  the  stochastic  load  fluctuations. 
We  study  the  optimal  window  size  for  static  flow  control  which  results  in 
maximum  throughput.  In  dynamic  flow  control,  parameters  of  the  system 
are  dynamically  adjusted  to  the  resource  availability  of  the  network. 
Based  on  Markov  decision  theory,  an  optimal  policy  to  dynamically  decide 
on  the  window  size  is  formulated.  Because  an  exact  solution  to  the  prob¬ 
lem  is  laborious,  an  effective  heuristic  solution  to  the  problem  is 
presented. 

Some  problems  in  interconnection  of  networks  are  also  studied. 
Based  on  our  study  of  static  flow  control,  a  network  algebra  for  series 


and/or  parallel-connected  networks  is  developed.  The  problem  of  resource 
allocation  in  parallel -connected  networks  is  investigated.  Finally,  a 
number  of  buffer  allocation  schemes  in  nodes  where  a  number  of  networks 
merge  together  is  analyzed.  These  allocation  schemes  serve  as  a  further 
tool  for  the  control  of  traffic  in  networks. 
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CHAPTER  1 


INTRODUCTION 


In  the  recent  past,  we  have  witnessed  an  explosion  in  the 


analysis,  design  and  implementation  of  computer  communication  networks. 
These  efforts  reflect  the  desire  to  exploit  and  share  the  computational 


capacity  of  geographically  separated  computer  systems,  which  results  in 
a  considerable  economy  of  scale  [ROBE  70] . 


The  demand  for  remote  processing  is  growing  at  an  enormous  rate 


[KLEI  76B] .  For  example,  it  is  projected  that  within  less  than  five 


years  in  Europe  there  will  be  approximately  five  million  terminals  in 


use  and  the  data  traffic  will  grow  up  to  10  bits/day  [PETE  73] 


Computer  networks  have  been  in  existence  since  the  mid  1950's 


One  of  the  earliest  computer  networks  was  the  SAGE  defense  network  in 


the  1950's  [EVER  57].  The  American  Airlines  SABRE  reservation  system 


came  shortly  thereafter  [EVAN  67].  Early  computer  networks  were  either 
designed  for  special  purposes  or  owned  by  private  corporations.  It  was 


only  after  the  conception  of  the  ARPANET  [CARR  70],  [FRAN  70],  [HEAR  70], 
[KLEI  70],  [ROBE  70],  that  the  surge  in  interest  toward  computer  communi 


cations  and  remote  data  processing  began. 

Two  general  categories  of  networks  can  be  distinguished:  the 


remote-access  networks  and  the  value-added  networks  [KIMB  75].  Remote- 
access  networks  are  designed  to  provide  access  between  a  user  and  a  given 


remote  computer,  either  for  terminal  ;ess  or  for  remote  job  entry 


Examples  of  this  type  of  network  are  the  GE  information  service  MARK  III 


_ _ t  - . 


(MALIC  74],  [SCHW  77],  and  TYMNET  (TYME  71],  [HARC  75].  (This  latter 
network  also  falls  into  the  second  category;  see  below.)  Value-added 
networks  support  communications  directly  between  computers  to  provide 
more  computer  power  and  to  distribute  the  computational  load  more  effi¬ 
ciently;  early  examples  of  experimental  networks  are  the  ARPANET  [refer¬ 
ences  cited  above],  the  British  NPL  Network  [DAVI  73]  and  the  French 
CYCLADES  [POUZ  75].  Examples  of  more  recent  networks  include  Telenet* 
Datapac  and  Tranpac.  By  providing  a  suitable  interface  between  host  com¬ 
puters  and  user's  terminals,  value-added  networks  can  easily  support 
remote-access  capability.  In  this  dissertation  we  are  mainly  concerned 
with  the  second  category  and  refer  to  value-added  networks  simply  as 
computer  networks. 

Figure  1.1  shows  a  simplified  computer  network.  It  consists  of 
a  number  of  disjoint  "local"  networks  and  a  "communication  sub-network." 
A  local  network  consists  of  one  or  more  computers  (HOSTS),  each  one  con¬ 
nected  to  the  communication  sub-network  through  one  node.  Users'  ter¬ 
minals  are  ususally  connected  to  HOSTs  and  are  part  of  the  local  network. 
(Users'  terminals  may  also  be  directly  connected  to  the  communication 
sub-net  via  a  TIP.) 

The  communication  sub-net  is  a  network  of  facilities  (hardware 
and  software)  to  deliver  messages  from  one  HOST  to  another  (or  from  one 
user  to  a  HOST) .  The  communication  sub-net  consists  of  a  number  of  com- 
muncation  processors  or  switching  centers,  connected  to  each  other 
through  communication  media.  Henceforth  we  will  refer  to  a  switching 
center  simply  as  a  node,  and  to  the  communication  sub-network  as  a 
communication  net. 
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Figure  1.1.  A  Computer-Communication  Network 


Two  alternative  media  can  be  used  to  connect  the  nodes  to  each 
other  and  to  host  computers:  broadcast  and  point-to-point.  Broadcast 
media  permit  simultaneous  transmission  (or  reception)  to  (or  from)  a 
number  of  nodes  [KIMB  75].  Two  examples  are  satellite  communication 
[ABRA  73],  [LAM  74],  [LAM  75C],  and  packet-switched  radio  [ABRA  70], 
[KAHN  75].  The  other  type  of  communication  media,  point-to-point,  per¬ 
mits  communication  only  between  two  nodes.  Examples  are  copper  wire; 
coaxial  cable;  microwave,  radio  and  satellite  channels  (note  that  these 
last  media  can  be  used  either  as  broadcast  media  or  as  point-to-point 
media) .  In  this  research  we  are  only  concerned  with  point-to-point 
media  between  two  points  and  refer  to  them  as  channels,  links,  or  lines. 

The  technique  incorporated  in  a  communication  net  to  deliver 
messages  from  one  HOST  to  another  is  called  the  switching  technique. 


•  Packet  switching 

(Both  message  and  packet  switching  are  members  of  a  larger  class  known 
as  store-and-foTward  switching.)  These  schemes  are  distinguished  by 
the  manner  in  which  resources  are  allocated  in  support  of  communication. 


Circuit  Switching 


With  circuit  switching,  a  complete  path  of  communication  must 


be  set  up  between  two  parties  before  the  communication  begins.  Path 
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set-up  time  plays  a  critical  role  in  the  performance  of  this  switching. 
Once  a  path  is  set  up,  it  is  tied  up  from  the  time  that  the  first  bit  of 
the  first  message  until  the  last  bit  of  the  last  message  is  transmitted 
between  the  two  parties  (this  includes  idle  times  as  well).  Circuit 
switching  is  a  common  method  for  telephone  switching  [SYSK  60];  when 
applied  to  data  communication  networks,  it  suffers  from  two  major  draw¬ 
backs:  slow  path  set-up  and  low  channel  utilization  (more  about  this  in 
Chapters  2  and  3).  This  type  of  switching  has  been  the  primary  technol¬ 
ogy  utilized  in  such  computer  networks  as  CYBERNET  [LUTH  72],  INFONET 
and  DATRAN  [CHEN  75A,B],  [RAND  75]. 


Message  Switching 


In  this  technique  a  message  is  defined  to  be  the  logical  unit 


of  information  from  the  viewpoint  of  transmission.  Each  message  carries 
with  it  information  regarding  its  destination.  This  is  done  by  attach¬ 
ing  addressing  bits  (usually  a  part  of  the  header)  to  each  message.  In 
this  switching  technique  only  one  channel  is  used  at  a  time  for  a  given 
transmission.  A  message  first  travels  from  its  source  node  to  the  next 
node  in  its  path,  and  when  the  entire  message  is  received  at  this  node, 
then  the  next  step  in  its  journey  is  started  by  transmitting  the  mes¬ 
sage  on  a  selected  channel  towards  the  destination;  if  the  selected 
channel  is  busy,  the  message  waits  in  a  queue,  and  finally,  when  the 
channel  becomes  free,  transmission  starts.  Thus  the  message  "hops"  from 
node  to  node  through  the  network,  using  one  channel  at  a  time,  possibly 
queueing  at  busy  channels  as  it  is  successively  "store-and-forwarded" 
through  the  network  [KLEI  64],  [KLEI  76B] .  Two  examples  of  networks 


which  use  this  type  of  switching  are  the  AUTODIN  [PAOL  75]  and  TYMNET 

[TYME  71],  [HARC  75]. 

* 

Packet  Switching  * 

This  switching  technique  is  basically  the  same  as  message 
switching,  except  that-  messages  are  subdivided  into  packets  with  a 
maximum  length,  prefaced  with  suitable  address  information.  Through 
decomposition  of  messages  into  packets,  many  packets  of  the  same  mes¬ 
sage  may  be  in  transmission  simultaneously,  and  a  message  can  use  a 
number  of  links  on  a  path  concurrently.  Due  to  more  complete  resource 
sharing  (here  the  resource  is  the  communication  link),  a  higher  channel 
utilization  and  lower  network  delay  is  possible  [BARA  64],  [DAVI  68], 
[ROBE  67],  [KLEI  76B] .  The  ARPANET  is  an  example  of  a  packet-switched 
network.  Other  examples  include  the  British  NPL,  the  French  CYCLADES, 
and  also  the  first  large-scale  commercial  packet -switched  network,  the 
TELENET. 

Analysis  of  the  tradeoffs  which  must  be  considered  in  evaluat¬ 
ing  the  three  switching  techniques,  and  the  decision  as  to  which  one  to 
use  is  a  difficult  task.  Some  tradeoff  studies  have  been  conducted 
[PORT  71],  [CLOS  72A] ,  [CLOS  72B] ,  [CLOW  73],  [ITOH  73],  [MIYA  75], 
[ROSN  76] ,  [KUKW  76A]  and  [KUMM  76B] ,  but  the  techniques  are  not  quite 
satisfactory  and  the  agreement  on  which  type  of  switching  is  superior 
is  not  universal.  One  thing  is  clear;  namely,  if  the  message  to  be 
transmitted  is  very  long,  circuit  switching  is  a  good  choice.  On  the 
other  hand,  if  the  messages  are  relatively  short,  it  pays  to  use  some 
sort  of  store -and -forward  facility.  The  fact  that  in  practice  there  is 
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a  broad  range  of  message  lengths  and  message  arrival  rates  motivates 
the  consideration  of  a  hybrid  mixture  of  store-and-forward  and  circuit 
switching  systems,  or  the  concept  of  integrated  networks  incorporating 
both  message  (or  packet) -switched  and  circuit-switched  capability.  A 
few  systems  based  on  integration  of  switching  systems  are  either  in 
existence  or  have  been  proposed  [ZAFI  76],  [PAOL  75],  [GERL  78];  how¬ 
ever,  no  system  based  on  a  hybrid  switching  technique  has  been  yet 
developed. 

1.2  Routing  and  Flow  Control  in  Store-and-Forward  Computer  Networks 

In  the  previous  section  we  considered  the  switching  alterna¬ 
tives  in  computer  networks.  In  many  (but  not  all)  cases  store-and- 
forward  switching  manifests  a  significant  advantage  over  circuit  switch¬ 
ing,  since  it  permits  simultaneous  support  of  multi-modal  traffic.  The 
success  of  the  first  packet  switched  network,  the  ARPANET,  has  generated 
great  enthusiasm  toward  adopting  this  type  of  switching. 

In  the  design  of  a  store-and-forward  network  one  faces  a  multi¬ 
tude  of  problems,  among  which  are:  topological  design;  capacity  allo¬ 
cation;  routing  procedure;  and  flow  control  procedure.  Several  differ- 

1 

ent  formulations  of  the  topological  design  and  capacity  allocation  prob¬ 
lem  can  be  found  in  the  literature  [KLEI  64],  [FRAN  70],  [FRAN  72], 

[GERL  73A],  [KLEI  76B] ,  [SCHW  77],  for  which  solutions  have  been 
proposed.  In  a  store-and-forward  network,  messages  (or  packets)  travel 
from  their  sources  to  their  destinations  "hopping"  from  one  node  to 
another.  Once  a  packet  (or  message)  enters  an  intermediate  node,  the 
next  node  to  which  it  should  be  sent  is  selected  according  to  a  well- 
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defined  decision  rule  referred  to  as  the  routing  procedure.  There  are 
a  variety  of  approaches  to  establishing  routes  for  messages  (or  pack¬ 
ets)  in  a  store-and-forward  network  [SCHW  77] :  the  routes  can  be  estab¬ 
lished  centrally  (as  in  TYMNET  and  SITA  [CHRE  73]),  or  locally  (as  in 
the  ARPANET);  they  can  be  fixed  and  deterministic,  chosen  on  the  basis 
of  average  traffic  statistics  (as  in  SITA) ;  they  can  be  fixed  during 
the  entire  message  interval,  although  varying  with  demand  (as  in  TYMNET, 
in  which  a  central  supervisory  program  establishes  a  route  -  fixed 
throughout  the  conversation  -  each  time  a  user  connects  into  the  system); 
or  they  can  be  adaptive,  following  a  stochastic  control  strategy  (as  in 
the  ARPANET) .  Several  classification  schemes  have  been  devised  to 
characterize  routing  policy  [KLEI  64],  [FULT  72],  [GERL  73B] ,  [MCCO  75], 
[MCQU  74],  [KAMO  76B] .  The  various  routing  methods,  whether  fixed  or 
adaptive,  local  or  central,  deterministic  or  stochastic,  attempt  to 
direct  messages  from  source  to  destination  such  that: 

i.  Delay  becomes  minimal  and 

ii.  Traffic  is  so  distributed  in  the  network  that  con¬ 
gestion  is  avoided. 

A  computer  network  is  a  collection  of  resources  (communica¬ 
tion  links,  etc.)  to  be  used  by  competing  users  (messages).  This  col¬ 
lection  of  resources  has  a  finite  capacity  which  causes  conflict  to 
occur  between  the  users  of  the  system.  These  conflicts  can  result  in  a 
degradation  of  performance  of  the  system  to  the  point  that  throughput 
goes  to  zero;  a  typical  behavior  of  "contention"  systems  [AGNE  74]. 

Even  with  the  best  routing  procedure,  if  the  network  becomes  overloaded, 
this  throughput  degradation  becomes  inevitable.  Networks  cannot  afford 
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to  accept  all  the  traffic  that  is  offered  to  them  without  some  control. 
There  must  be  rules  which  govern  the  acceptance  of  traffic  from  outside 
and  coordinate  the  flow  inside  the  network.  These  rules  are  commonly 
known  as  flow  control  procedures. 

The  set  of  rules  that  control  the  flow  of  messages  (once  they 
are  admitted)  inside  the  network  are  usually  referred  to  as  "local  con¬ 
trol",  as  opposed  to  "global  control",  which  is  the  set  of  control  pro¬ 
cedures  which  supervise  the  admission  of  messages  to  the  network.  Most 
commonly  used  global  flow  controls  can  be  divided  into  two  categories: 
end-to-end  flow  control,  used  in  the  ARPANET  [KAHN  7]],  [CERF  74];  and 
isarithmic  flow  control,  as  considered  in  the  NPL  network  [DAVI  71], 
[DAVI  73],  [PRIC  73].  End-to-end  flow  control  is  usually  described  in 
terms  of  "window"  mechanism  [CERF  74],  [BELS  75],  where  the  number  of 
"unacknowledged"  messages  (or  packets)  between  a  source  and  destination 
is  limited  to  the  window  size.  End-to-end  flow  control  is  accomplished 
by  inter-process  communication  protocols  and  any  attempt  to  quanti¬ 
tatively  study  the  former  should  start  with  the  development  of  an 
analytic  model  for  the  latter.  Except  for  simulation  studies  [PRIC  73], 
[GIES  76]  and  [LELA  76],  in  most  analytic  work  done  so  far  the  inter¬ 
action  of  different  flow  control  tools  has  not  been  studied  [PENN  74], 
[PENN  75],  [SUNS  75],  [CHAT  76],  [CHOU  76]  and  [FAYO  77]. 

1 . 3  Scope  of  this  Research  and  Outline  of  the  Dissertation 

In  the  previous  sections  we  examined  some  issues  in  computer- 
based  communication  networks  and  (intentionally)  elaborated  in  more 
detail  on  the  switching  techniques  and  flow  control  mechanisms  in  such 
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networks,  the  two  issues  to  which  this  dissertation  is  addressed. 

In  Section  1 . I  we  found  that  a  hybrid  mixture  of  circuit 
switching  and  store-and-forward  switching  may  be  a  good  alternative,  as 
it  can  handle  a  broad  range  of  message  lengths  and  arrival  rates.  We 
also  pointed  out  that  very  little  satisfactory  quantitative  work  has 
been  done  regarding  the  tradeoff  study  of  switching  systems.  In  this 
dissertation  we  introduce  and  analyze  a  new  hybrid  switching  method 
and  also  carry  out  a  tradeoff  study  of  switching  systems. 

It  has  been  realized  that  the  behavior  of  a  computer  network 
is  similar  to  a  "contention"  system  in  that  as  traffic  increases, 
throughput  of  the  system  degrades  (possibly  to  zero)  and  its  delay 
escalates  [RUDI  76].  Very  few  analytic  works  can  demonstrate  this 
phenomenon.  In  our  study  of  flow  control,  we  develop  analytic  models 
for  flow  control  based  on  window  mechanisms  which  can  predict  this 
behavior.  The  control  schemes  that  we  study  are  categorized  as  "static" 
and  "dynamic"  flow  controls.  In  static  flow  control,  parameters  of  the 
system  are  not  adjusted  to  the  stochastic  load  fluctuations  on  the  sys¬ 
tem.  We  study  the  optimal  window  size  for  static  flow  control  which 
results  in  the  maximum  throughput.  In  dynamic  flow  control,  parameters 
of  the  system  are  dynamically  adjusted  to  the  resources  available  in 
the  network.  Based  on  Markov  decision  theory,  an  optimal  policy  to 
dynamically  decide  on  the  window  size  is  formulated.  Because  an  exact 
solution  to  the  problem  is  laborious,  a  heuristic  solution  to  the  prob¬ 
lem  is  presented. 

The  issue  of  interconnection  of  computer  networks  (otherwise 
known  as  internetting)  has  recently  become  of  major  interest.  In  this 
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dissertation  we  study  some  problems  related  to  internetting.  Based  on 
our  study  of  static  flow  control,  we  develop  a  network  algebra  for 
series  and/or  parallel -connected  networks  and  also  investigate  the 
problem  of  resource  allocation  in  parallel -connected  networks.  Finally, 
we  analyze  a  number  of  buffer  allocation  schemes  in  nodes  where  a  number 
of  networks  merge  together.  These  allocation  schemes  serve  as  a  further 
tool  for  the  control  of  traffic  in  networks. 

In  general,  the  goal  of  this  dissertation  is  to  develop  ana¬ 
lytic  tools  for  the  performance  study  and  design  of  computer  networks. 

In  many  cases  the  models  which  we  use  are  rather  simplistic;  neverthe- 

% 

less,  they  help  us  understand  the  implications  of  real  system  behavior. 

Chapters  2  and  3  are  concerned  with  the  switching  problems  in 
computer  nets. 

In  Chapter  2  a  hybrid  switching  technique,  called  "virtual 
cut-through",  is  introduced  and  its  performance  is  compared  with  that 
of  message  switching.  The  delay  analysis  of  "virtual  cut-through"  is 
carried  out  for  two  cases:  noiseless  channels  and  noisy  channels.  It 
is  shown  that  in  most  cases,  the  network  delay  of  virtual  cut -through 
is  less  than  that  of  message  switching.  At  the  end  of  this  chapter  we 
study  an  optimal  design  problem  for  this  switching  technique. 

In  Chapter  3  we  carry  out  a  tradeoff  study  of  three  switching 
techniques:  virtual  cut-through,  message  switching,  and  circuit 
switching.  We  first  formulate  a  network  model  for  delay  in  a  circuit 
switched  system;  however,  no  simple  solution  to  the  transform  equation 
which  results  from  this  model  has  been  found.  Next,  we  generalize  the 
independence  assumption  [KLEI  64]  and  develop  an  analytic  model  for  the 


delay  of  a  communication  path  between  two  users  in  a  circuit  switching 
network.  Having  thus  developed  expressions  for  delay  in  circuit  switch¬ 
ing,  we  compare  the  performance  of  the  three  switching  techniques. 

While  our  study  confirms  the  previous  understandings  that  circuit 
switching  is  favorable  for  long  and  infrequent  message  traffic,  it  also 
shows  that  performance  of  circuit  switching  depends  very  much  on  the 
topology  of  the  network  and  also  on  the  number  of  channels  per  trunk. 

In  general,  for  long  paths,  if  the  number  of  channels  per  trunk  is 
properly  selected,  network  delay  for  circuit  switching  is  less  than 
message  switching  and  cut-through  switching.  Our  study  indicates  that 
the  selection  of  switching  technique  should  be  based  upon  a  careful 
study  of  parameters  of  the  network  and  characteristics  of  the  traffic. 

At  the  end  of  this  chapter  we  study  an  optimal  design  problem  based  on 
our  comparison  study  between  message  switching  and  circuit  switching. 

Chapters  4  and  5  are  concerned  with  the  flow  control  problems 
in  computer  networks. 

In  Chapter  4  we  develop  analytic  models  for  end-to-end  com¬ 
munication  protocols  and  study  the  window  mechanism  for  flow  control  in 
st ore -and -forward  computer-based  communication  networks.  We  start  with 
a  very  simple,  deterministic  model  in  which  there  are  no  stochastic 
fluctuations  in  the  load  on  the  system.  Then,  step  by  step,  we  improve 
this  basic  model  and  develop  a  static  flow  control  model  in  which  the 
parameters  of  the  system  are  not  dynamically  adjusted  to  the  stochastic 
fluctuations  of  the  system  load. 

In  Chapter  5  we  use  the  results  of  our  study  in  Chapter  4  to 
develop  a  dynamic  flow  control  in  which  window  size  and  certain  other 


parameters  of  a  network  are  dynamically  adjusted  to  the  stochastic 
fluctuations  in  the  system  load.  The  dynamic  decision  on  optimum  window 
size  is  formulated  as  a  Markov  decision  process.  Having  thus  formulated 
the  process,  we  develop  a  heuristic  solution  to  the  problem  as  an 
alternative  to  the  laborious  exact  solution.  Based  on  the  heuristic 
solution,  a  (sub)optimal  policy  to  decide  on  window  sizes  is  introduced 
and  its  performance  is  compared  with  the  optimal  one. 

In  Chapter  6  we  study  some  issues  related  to  the  interconnec¬ 
tion  of  computer  networks.  We  begin  by  establishing  a  network  algebra 
as  an  aid  in  the  systematic  study  of  computer  networks  interconnection; 
we  then  look  at  some  design  problems  for  allocation  of  resources  among 
a  number  of  parallel -connected  networks.  Finally  we  discuss  buffer 
allocation  strategies  in  nodes  where  a  number  of  networks  merge  together. 

Appendix  A  contains  a  list  of  often-used  symbols. 

In  Appendix  B  we  present  proofs  for  some  of  the  theorems  stated 
in  Chapter  2. 

In  Appendix  C  some  results  from  Markov  decision  theory  are 
presented.  These  results  are  used  in  Chapter  5. 

In  Appendix  D  we  study  various  buffer  sharing  and/or  allocation 
schemes  for  a  multiplexing  node  [KERM  77].  The  study  shows  that  in 
order  to  provide  a  fair  service  to  different  streams  of  data,  some  non¬ 
trivial  sharing  schemes  should  be  implemented.  Chapter  6  contains  per¬ 
formance  curves  for  different  sharing  schemes  presented  in  this  appendix. 
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CHAPTER  2 


"VIRTUAL  CUT-THROUGH" 

A  NEW  SWITCHING  TECHNIQUE 

One  of  the  basic  problems  in  a  computer  communication  network 
design  is  to  select  the  switching  method.  Early  computer  networks  were 
built  either  over  the  existing  telephone  networks  or  with  leased  lines, 
using  principles  of  telephone  switching,  namely  line  (circuit)  switching. 
With  circuit  switching,  a  complete  path  of  communication  links  must  be 
set  up  between  two  parties  before  the  real  communication  begins.  This 
setup  is  accomplished  via  a  signalling  message.  The  path  is  tied  up 
from  the  time  the  first  bit  of  the  first  message  until  the  last  bit  of 
the  last  message  is  transmitted  between  the  two  parties.  This  includes 
any  idle  periods  during  which  the  two  parties  are  silent.  Once  a  path 
is  set  up,  no  further  signalling  for  addressing  purposes  is  necessary; 
thus,  in  a  circuit-switched  network,  a  path,  once  set  up,  essentially 
provides  all  the  addressing  information. 

When  applied  to  data  communication  networks,  circuit  switching 
suffers  from  some  drawbacks.  One  is  the  slow-call  setup  which  delays 
the  transfer  of  messages  from  sender  to  receiver.  The  fact  is,  circuit 
switching  is  essentially  used  in  telephone  networks  which  have  been 
designed  for  human  communication.  A  telephone  conversation  is  typically 
an  order  of  magnitude  longer  than  the  signalling  delay.  On  the  other 
hand,  measurement  studies  [JACK  69,  FUCH  70]  conducted  on  time-sharing 
systems  indicate  that  in  computer  communication,  data  streams  are  bursty. 
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That  is,  they  consist  of  frequent  short  messages  with  sporadic  long  ones. 
The  excessive  signalling  delay,  especially  for  short  messages,  is  a 
serious  disadvantage  of  this  switching  technique.  Another  drawback  is 
that  channel  utilization  may  be  low  due  to  the  fact  that  the  channels  on 
a  path  are  tied  up  but  actually  are  not  being  used  during  the  idle 
periods.  Furthermore,  as  we  will  see  in  Chapter  3,  the  communication 
network  saturates  very  fast  due  to  excessive  holding  time  of  channels. 
Fig.  2.1-a  shows  the  network  delay  in  a  circuit-switched  system.  It 
is  assumed  that  there  is  no  interfering  traffic  and  that  the  number  of 
intermediate  nodes  in  the  path  is  two. 

In  order  to  achieve  a  better  channel  utilization,  one  may 
think  of  relinquishing  the  channels  on  a  path  during  periods  in  which 
the  parties  are  silent.  This  brings  us  the  idea  of  (store-and- forward) 
message  switching.  In  this  method,  messages  are  routed  toward  their 
destination  node  without  establishing  a  path  beforehand.  Through  pro¬ 
vision  of  a  storage  facility  at  each  node,  messages  are  stored  in 
intermediate  nodes  and  then  are  sent  forward  to  a  selected  adjacent 
node  (hence  the  name,  store-and- forward) .  This  selection  is  made  by  a 
well-defined  decision  rule  referred  to  as  the  routing  algorithm.  The 
process  is  repeated  until  the  message  reaches  the  destination  node. 

Each  message  carries  with  it  the  information  regarding  its  destination. 
This  is  done  by  attaching  addressing  bits  (usually  a  part  of  the  header) 
to  each  message.  In  this  switching  method,  by  not  allocating  the  com¬ 
munication  links  into  complete  paths  for  specific  source-destination 
pairs  of  nodes,  each  link  is  statistically  shared  by  many  messages. 

Fig.  2.1-b  shows  the  network  delay  in  a  message- switching  system. 
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Fig.  2.1  Operations  of  Different  Switching  Systems . 
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Examination  of  Fig.  2.1-b  suggests  that  a  better  utilization 
and  further  reduction  in  network  delay  is  possible  by  dividing  a  message 
into  smaller  pieces  called  packets.  This  method  is  called  packet  switch¬ 
ing  and  is  shown  in  Fig.  2.1-c.  In  packet  switching,  each  packet  (rather 
than  each  message)  carries  addressing  information.  This  introduces  extra 
overhead;  however  by  dividing  messages  into  packets  a  message  can  use  a 
number  of  links  on  a  path  simultaneously,  and  due  to  a  more  complete 
resource  sharing  (here  the  resource  is  the  communication  links),  a 
higher  channel  utilization  and  lower  net  delay  is  possible. 

Reflection  on  Figs.  2.1-b  and  2.1-c  shows  that  the  extra  delay 
in  message  switching  is  caused  by  the  fact  that  we  do  not  permit  a  mes¬ 
sage  to  be  transmitted  out  of  a  node  before  it  is  received  completely. 

We  notice  that  when  a  message  arrives  in  an  intermediate  node  and  its 
outgoing  channel  is  free,  it  actually  does  not  need  to  be  completely 
received  in  the  node  before  being  transmitted  out.  In  fact  it  can  be 
transmitted  out  immediately  after  the  outgoing  channel  is  recognized, 
i.e.,  after  the  reception  of  its  header  and  the  selection  of  an  outgoing 
channel  by  the  routing  procedure.  That  is,  we  only  buffer  when  we  must 
(due  to  a  busy  channel)  but  avoid  the  delay  due  to  unnecessary  buffering 
in  front  of  an  idle  channel.  This  is  essentially  a  hybrid  mixture  of 
circuit-switching  and  packet-switching  techniques.  Fig.  2.1-d  shows 
the  delay  when  this  "virtual  cut-through"  method  is  used. 

If  the  packet  encounters  busy  channels  at  all  of  the  inter¬ 
mediate  nodes,  the  outcome  is  exactly  the  same  as  a  packet-switched 
network.  On  the  other  hand,  if  all  of  the  intermediate  channels  are 
free,  the  outcome  is  similar  to  a  circuit-switched  system.  In  general, 
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the  delay  of  the  cut-through  system  is  somewhere  between  packet-switched 
and  circuit-switched  systems,  and  as  Fig.  2.1-d  shows,  this  delay  is 
less  for  smaller  packets. 

In  this  chapter  we  will  study  and  analyze  the  performance  of 
this  switching  technique  and  compare  it  with  the  packet-switching  method. 
We  start  with  some  assumptions  and  terminology. 

2.1  Assumptions 

In  the  opening  of  this  chapter  we  discussed  message-  and  packet¬ 
switching  systems.  The  essential  difference  between  these  two  systems 
is  that  in  packet  switching  a  message  consists  of  multiple  packets.  In 
this  chapter  we  assume  that  messages  consist  of  single  packets  and  we  no 
longer  differentiate  between  these  two  systems. 

Consider  a  "tagged"  message  in  a  message-switching  communication 
net.  Whenever  the  message  arrives  in  an  intermediate  node  in  which  the 
outgoing  channel  is  free,  we  say  that  the  message  arrives  in  a  "free" 
node  (this  happens  with  probability  (1  -  P  ),  P^  being  the  probability  of 
having  to  wait).  Furthermore,  because  the  message  can  be  sent  out  imme¬ 
diately  after  its  header  is  received  (rather  than  the  entire  message), 
we  say  a  "cut -through"  has  occurred.  If  after  the  reception  of  the 
header,  the  message  cannot  be  sent  out  immediately  (due  to  busy  channels) 
then  it  must  be  received  completely  before  being  transmitted  out  (i.e., 
partial  cuts  are  not  allowed).  The  reason  for  this  assumption  is  two¬ 
fold.  First,  in  the  case  of  noisy  channels  the  message  can  be  error- 
checked  when  it  is  blocked  (more  about  this  in  Section  2.3.2),  and 
second,  this  assumption  makes  the  analysis  simpler. 


19 


— 


The  following  assumptions  are  accepted  throughout  the  analysis: 

1.  External  Poisson  arrivals. 

2.  Exponentially  distributed  message  lengths. 

3.  Infinite  nodal  capacity. 

4.  Deterministic  routing. 

5.  Independence  assumption  [KLEI  64]. 

6.  Balanced  network  (defined  below). 

7.  Negligible  propagation  delay. 

Definition  2.1 

A  network  is  said  to  be  balanced  if  the  utilization  factor 
of  all  of  its  channels  is  the  same.  Note  that  this  property  does 
not  imply  any  assumption  on  the  topology  of  the  network. 

A  detailed  discussion  and  justification  of  the  above  assump¬ 
tions  can  be  found  in  [KLEI  64],  [GERL  73A]  and  [KAMO  76B] . 

The  analysis  is  carried  out  for  multiple  channel  links.  We  use 
the  word  link  for  the  communication  media  between  nodes.  Each  link  can 
be  split  into  one  or  more  channels. 

Appendix  A  contains  a  list  of  notation  used  in  this  chapter. 

2.2  Analysis  of  the  Number  of  Cut-Throughs 

Although  the  average  number  of  cut-throughs  is  not  a  good 
measure  of  performance  it  is  used  to  evaluate  the  network  delay.  In 
this  section  we  investigate  some  of  its  properties. 

Theorem  2 . 1 

The  average  number  of  cut-throughs  in  a  balanced  network  with 
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channel  utilization  p  is  given  by 

nc  =  (nh-  1)  (1  -  Pw)  (2.1) 

where  nc  is  the  average  number  of  cut-throughs ,  n^  is  the  average  num¬ 
ber  of  hops  (or  path  length),  and  Pw  is  the  probability  of  waiting 
(no  cut) . 


Proof: 

Whenever  a  message  enters  a  free  node  it  can  make  a  cut..  This 
event  occurs  with  probability  1  -  P  .  Due  to  the  independence  assumption, 
the  number  of  cuts  has  a  binomial  distribution  [FELL  50]  and  we  have: 

Pr[nc  =  k  |  nh  =  n,  p]  =  (n  "  X) (1  -  PjV'^1  0<k<n-l 


The  conditional  generating  function  of  the  number  of  cut-throughs  may  be 
written  as 


C  (z)  =  E [z 
n.p'-  1 


n. 


n  ;  1 )  (i  -  p  )kpn*k'1 
k  /  v  w'  w 


C  (z)  =  [P  ♦  z(l - P  )]n_1 
n,p  '  1  w  v  w'J 

We  may  now  find  the  conditional  mean  number  of  cut-throughs: 

_ dC  (z)  | 

VE["'CI  Vn-  pl  ^ -  I  =(n-l)(l-Pw) 

z=  1 

\  =  (n  -  1)  Cl  -  Pw) 

The  mean  number  of  cut-throughs  is  therefore 


nc  =  E  Etfic  I  fih  =  n*  P]prtfih  =  nl  ■  O^-lKl-P,,) 

n 
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and  so 


nc 


Q.E.D. 


The  unconditional  generating  function  of  the  number  of  cut- 
throughs  is  derived  below: 

Cp(2)  =  £  Cn,p(z)PrlVn]  =  E  Ipw  +  z(l-Pw)ln-1Pr[nh  -  n] 


or 


CP(Z)  =  Pw  ♦  2(1  *  P  ) 


(2.2) 


w  '  w 


A  h 

where  N(z)  =  E[z  n]  is  the  generating  function  of  the  number  of  hops. 


Notice  that 


n  = 


dCp(z) 


,(1) 


c  dz 


z=l 


-  OLlULsJUlL  (1 .  P  ) 

1  w 


but 


so 


n  =  (n,  -  1)(1  -  P  ) 
c  v  h  ‘ K  wJ 

as  we  had  before. 

For  the  special  case  of  single  channel  links,  Eqs.  (2.1)  and 
(2.2)  are  reduced  to 


n  =  (n.  -  1)  (1  -  p) 


c  (z)  =  Me*  z.(i  -  p.)J 

PL  J  P  +  z(l-p) 


which  were  first  reported  in  [KLEI  76A] . 


(2.3) 

(2.4) 
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These  equations  show  that  when  the  average  number  of  hops  (n^) 
increases,  so  does  the  average  number  of  cut-throughs.  This  is  intui¬ 
tively  clear  when  we  notice  that  as  the  number  of  intermediate  nodes 
increases  there  is  a  higher  chance  of  experiencing  more  cuts.  They  also 
show  that  nc  is  a  decreasing  function  of  p  which  means  that  in  a  less 
crowded  network  the  average  number  of  cuts  is  larger. 

Special  Cases 

(I)  P  =  1  This  implies  Pw  =  1  and  n£ =  0,  i.e,  no  cut-throughs 
are  made. 

(II)  p  =  0  This  implies  P  =  0  and  n  =  n,  -  1. 

r  w  c  h 

In  case  (I),  with  probability  one,  all  of  the  intermediate 
nodes  are  busy  upon  arrival  of  the  message.  The  network  then  behaves 
like  a  pure  message-switched  system.  In  (II),  with  probability  one, 
all  of  the  intermediate  nodes  are  free  and  all  of  them  are  cut  through; 
thus  it  resembles  a  circuit-switched  system.  Later  in  this  chapter  we 
will  show  performance  curves  for  the  system. 


2.3  Delay  Analysis 

An  important  performance  measure  for  a  computer  communication 
net  is  the  average  source  to  destination  message  delay  T,  defined  below: 


T  =  £  liiz.. 

ft  Y  1J 

where  y„  is  the  average  number  of  messages  entering  the  network  per 
second  with  origin  i  and  destination  j,  y  is  the  total  arrival  rate  of 
messages  from  external  sources,  and  is  the  average  message  delay  for 
messages  with  origin  i  and  destination  j . 
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For  a  message -switched  system,  a  straightforward  application 
of  Little’s  result  [L1TT  61]  to  the  queueing  model  leads  to  the  fol¬ 
lowing  expression  for  T  [KLEI  64] .  (Here  we  use  the  symbol  to 
indicate  that  this  delay  is  for  message- switching  systems.) 


where  T\  is  the  average  delay  at  node  i. 

Calculation  of  T\  is  in  general  a  nontrivial  and  currently 
unachieved  task;  however,  by  accepting  the  assumptions  of  Section  2.1, 
especially  the  independence  assumption  of  Kleinrock  [KLEI  64],  we  are  in 
a  position  to  reduce  the  network  of  queues  model  to  the  model  first 
studied  by  Jackson  [JACK  57].  By  virtue  of  his  results,  each  node 
behaves  stochastically  independent  of  the  other  nodes  and  similar  to 
an  M/M/m  system,  for  which  the  average  delay  is  reported  in  Appendix 
B.l.  For  the  special  case  of  uniform  p,  all  of  the  nodal  delays  are 
identical  and  we  have 


but  we  have  [KLEI  64] 


so  we  get 


T  =  n.T. 
m  hi 


(2.5) 


So  far  we  have  not  made  any  assumption  regarding  the  channel 
error  rate.  We  will  initially  give  an  analysis  of  delay  for  a  network 
with  noiseless  channels.  It  turns  out  that  for  such  a  system  the 


analysis  is  fairly  simple  and  a  closed  form  expression  for  network  delay 
is  obtainable.  However,  when  the  network  contains  noisy  channels,  as  we 
will  see  shortly,  the  network  delay  can  be  calculated  through  a  fairly 
complicated  iterative  routine.  We  will  present  a  delay  analysis  of  these 
two  systems;  however,  later  we  deal  only  with  networks  with  noiseless 
channels. 
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where  tQ  ( =  1/yC)  is  the  average  transmission  time  of  a  message  on  a 

channel,  t.  is  the  average  transmission  time  of  a  header  and  a  =  t,/t„. 

“  h  0 

PToof : 

For  each  node  at  which  a  cut -through  is  made,  a  nodal  service 
time  is  saved.  However,  this  service  time  is  conditioned  on  the  event 
that  the  waiting  time  is  zero.  So  we  have 

T  -  T  =  n  E[s  I  w  =  0] 
m  c  c  1  1  1 

where  n£  is  the  average  number  of  cuts,  s  is  the  total  delay  in  a  node 
and  w  is  the  waiting  time  in  a  node. 

However,  E[s  |  w  =  0]  =  tQ,  so 

T  -  T  =  n  t_ 
ro  c  c  0 

If  we  assume  that  a  message  can  be  sent  out  only  after  its 
header  is  received,  then  the  previous  equation  is  changed  to 

m  c  cv  0  h 

where  (tQ  -  th)  is  the  saving  in  delay  at  each  node  when  the  cut-through 
method  is  used.  Using  the  value  of  n£  from  Eq.  (2.1), we  get 

Tc  ■  T„  -  <V  1)0  -  P„H1  -  a)t0 

Q.E.D. 

n,  t  _ 

For  the  special  case  of  N  ,  =  1  and  a =  0,  we  have  T  *  -2-^- 

ch  m  1  -  p 

an<*  ^w3*3'  ant*  (2.7)  is  reduced  to 

Tc*rrf-  (Sfc-oa-pjt,,  (2.8) 

From  Eq.  (2.7)  we  have 
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T  -  T 
m  c 


•i: 


(nh-  1)  (I  -  a)tQ 


for  p  =  0 
for  p  =  1 

which  shows  that  if  all  of  the  intermediate  nodes  are  busy  (p  =  1  and 


(2.9) 


P  =  1),  no  reduction  in  delay  is  made.  On  the  other  hand,  when  p*  0 
w 

and  there  is  no  intermediate  busy  node,  the  transmission  times  of  the 
intermediate  channels  are  saved.  This  fact  becomes  clearer  when  we 
set  a  =  0. 


For  a=  1,  Eq.  (2.7)  gives  T£ =  T^;  this  is  intuitively  clear 
because  a=  1  implies  that  a  message  is  completely  composed  of  its  header 
and  the  cut-through  system  is  identical  to  message  switching. 


2.3.2  Delay  Analysis  for  a  Network  with  Noisy  Channels 

A  detailed  analysis  of  a  network  with  noisy  channels  can  be 
very  complicated.  The  difficulty  is  that  in  a  network  with  unreliable 
channels  there  should  be  acknowledgement  and  time-out  mechanisms  to 
guarantee  correct  delivery  of  messages;  modeling  of  such  mechanisms  is 
usually  an  involved  task  [KAMO  76B] ,  [LAM  75A] .  In  a  later  chapter  of 
this  dissertation,  on  dealing  with  flow  control  techniques,  we  will  pre¬ 
sent  a  detailed  model  of  an  acknowledgement  mechanism;  however  in  this 
section  we  accept  a  very  simplified  model  for  this  mechanism.  For  mes¬ 
sage  switching  we  assume  if  a  message  is  received  with  some  bits  in  error 
(which  happens  with  a  constant  probability  P  ) ,  it  is  retransmitted 
immediately  (i.e.,  instantaneous  negative  and  positive  acknowledgement). 
We  further  assume  that  with  the  retransmission  traffic,  the  network  is 
still  balanced  (Definition  2.1).  A  logical  picture  of  each  node  is  given 
in  Fig.  2.2. 
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If  the  average  delay  for  a  one-pass  transmission  is  s,  the 
total  nodal  delay  becomes 

-  T  N  u  P 

x  s  — 5 —  =  -ft.  + - * _  /  (i  .p  ) 

i  1  -  PA  pc  yC(l-p)  eJ 

and  the  network  delay  for  message  switching  will  be 

T»  *  [~gr  4  iiC('l^be)  ]  "h  /(1  •  Pe)  t21t» 

In  the  above  expression  pg  is  the  effective  network  traffic.  The  useful 
traffic  is  given  by 

P-(l-Pe)Pe  (2.11) 

For  the  cut-through  system  the  operation  is  somewhat  more 
involved.  We  assume  that  a  message  is  error-checked  only  in  its  desti¬ 
nation  node  or  any  intermediate  node  which  it  can  not  cut  through.  (We 
call  this  type  of  node  a  "final  node”  which  is  not  necessarily  the  same 
as  the  destination  node,  although  the  destination  node  is  always  a  final 
node) .  It  is  not  that  a  message  could  not  be  checked  for  error  in  an 
intermediate  node  which  it  cuts  through,  but  even  if  an  error  were 
detected  (this  can  be  done  only  after  the  last  bit  of  a  message  arrives 
in  a  node)  there  is  no  way  to  stop  the  transmission  and  notify  the  next 
node(s)  not  to  accept  the  message  because  the  message  has  already  begun 
to  be  transmitted  out  of  the  node.  In  a  final  node  error  checking  can 
be  done  because  the  message  must  be  received  completely.  If  it  is 
recognized  that  the  message  is  erroneous,  it  has  to  be  retransmitted 
through  all  of  the  intermediate  nodes  (including  the  first  node  on  the 
cut-through  path).  As  before,  we  further  assume  that  acknowledgements 
(positive  and/or  negative)  are  instantaneous.  Figure  2.3  makes  the 
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operations  more  clear.  In  this  figure,  a  message  is  to  be  transmitted 
from  source  node  S  to  destination  node  D.  The  message  has  already  made 
its  way  to  an  intermediate  node  i  (in  which  it  has  been  blocked) .  When 
it  starts  transmission  from  this  node,  it  makes  a  cut  through  nodes  i  +  1, 
i  +  2  and  i  +  3  and  is  again  blocked  at  node  i  +  4.  At  this  node,  after 
it  is  checked,  the  message  is  found  to  have  some  bits  in  error.  Through 
negative  acknowledgement  (which  we  assume  is  instantaneous) ,  it  is 
retransmitted  from  node  i.  In  the  retransmission  process,  however,  it 
can  make  a  cut  through  node  i+1  only  and  is  blocked  at  node  i  +  2. 
Depending  on  whether  it  is  received  incorrectly  or  not,  further  retrans¬ 
mission  from  node  i  is  necessary  or  it  can  start  making  its  way  toward 
its  destination  node  D  (after  queueing  delay). 

The  delay  analysis  for  the  cut-through  system  in  a  network  with 
noisy  channels  is  fairly  complicated,  and  in  fact  no  closed  form  expres¬ 
sion  for  it  could  be  derived.  The  conditional  delay,  when  the  number  of 
hops  is  constant  (n^n),  is  given  below. 

Theorem  2.3 

The  average  network  delay  for  the  cut-through  system  in  a  net¬ 
work  with  noisy  channels  is  given  by 

T  =  T(n)(n) 
c  c  v 

where 
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and 

{(l-Pw)kPw  (0<k<j-l  and  j  >1) 

(1-Pw)k  (k  =  j  -  1) 

where  pg  is  the  effective  traffic  and  n  is  the  path  length.  (See 
Appendix  A  for  the  description  of  notations.) 

Proof :  Appendix  B.2 

If  we  set  P  =  0,  after  some  algebra  Eq.  (2.12)  reduces  to 
c 

Eq.  (2.7),  the  delay  for  the  cut-through  system  in  a  network  with  noise¬ 
less  channels. 
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The  useful  traffic  is  related  to  effective  traffic  according  to 


the  following  theorem 


Theorem  2.4 


For  the  cut-through  system  in  a  balanced  network  with  noisy 


channels,  the  useful  traffic  is  determined  according  to  the  following 


relationships 


where 


where  o  is  effective  channel  traffic  and  n  is  the  path  length 


Proof :  Appendix  B.3 


Some  special  cases  are  of  interest 


If  P  +1,  then  Eqs.  (2.10)  and  (2.12)  become  identical.  This 


is  intuitively  clear  as  in  this  case  n  =  0  and  retransmissions 


take  place  only  over  one  hop  in  both  systems. 

If  n  =  1,  again  Eqs.  (2.10)  and  (2.12)  become  identical 


Clearly  in  this  case  there  is  no  retransmission  and  effective 
traffic  is  the  same  as  useful  traffic. 


In  Section  2.S  we  will  present  some  performance  curves  and  will 
see  the  effect  of  channel  errors  on  network  performance.  In  the  remain¬ 
ing  sections  of  this  chapter  we  will  assume  that  network  channels  are 
noiseless. 


2.4  Traffic  Gain 

Analysis  of  Eq.  (2.7)  shows  that 

Tm(p)  '  TcCp)  -  °  0  1  P  1  1 

(Here  we  use  T^fp)  and  Tc(p)  to  indicate  the  delay  at  a  certain  traffic.) 

Later  in  this  chapter  we  present  some  performance  curves  which 
explicitly  show  this  fact.  This  property  indicates  that  at  the  same 
traffic  level  the  network  delay  in  the  cut-through  system  is  less  than 
in  message  switching.  Equivalently,  for  the  same  network  delay  the  cut- 
through  system  can  handle  more  traffic;  see  the  sketch.  Fig.  2.4.  It  is 
the  purpose  of  this  section  to  calculate  this  throughput  increase. 
(Actually,  what  we  find  is  the  difference  between  channel  utilization 
of  the  two  systems  at  the  same  delay;  however,  this  gives  us  a  measure 
of  traffic  gain.) 

If  we  set  Tm(pi)  =  T£(p2)  we  get 


rN  . 
ch 

P 

w  1 

=  n'h 

rN  u 

ch 

P 

w2  . 

PC 

PC(1  -  Pl) 

.  mc 

UC(1  -  p2) . 

-  ("h-'H1-  W  (2'14) 

where  P  is  the  waiting  probability  at  a  node  at  which  the  utilization 
i 

factor  is  p^  (recall  that  we  are  assuming  channels  are  noiseless). 
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For  >  1,  is  a  complicated  expression  of  p  and  (see 
Appendix  B.l),  and  it  is  not  possible  to  find  P2  in  terms  of  Pi  explic¬ 
itly.  For  this  reason  we  solve  Eq.  (2.14)  analytically  only  for  the 
case  where  N  ^  =  1.  For  higher  values  of  numerical  techniques  are 


necessary. 


For  N  =  1  we  have  P  =  p  and  Eq.  (2.14)  is  reduced  to 
ch  w  K  n 


-  l/yc  -  l/yc 

h  1  -  Pi  h  1  -  P2 


(nh-  l)(l-p2)(l-o) 


1  -  Pi  - 


_ 1  -  P2 _ 

1  -  n(l  -  pz)2(l  -a) 


where  n  =  (n^  -  l)/n^  »  0  <_  n  <_  1. 

from  the  previous  equation  we  obtain 


p2  -  n(l  -  a)(l  -  p2) 

Pi - T 

1  -  n(l  -  a)  (1  -  p2) 


(2.15a) 


2nd  -ct)(l  -  Pi)  »  1  -  VI  ♦  4q(l-q)(l-p1; 
2n(l  -  a) (1  -  Pi) 


(2.15b) 


1  +  2nd  -  a)  (1  -  Pi)  -  vi  +  4n(l-a)(l 
2nd  -  cO(i  -  Pi) 


(2.16) 


When  nh  =  1  (n  =  0) ,  then  p2  =  pj;  i.e.,  there  is  no  gain. 

This  is  the  case  for  a  fully  connected  net.  For  n^  f  00  (n  =  1)  we  have 


n  .  _  1  ♦  2(1  -  «)  (1  -  p 1 ) 2  -  M  ♦  4(1  -  a) (1  -  Pi)2 

p2  -  pi -  2 ( 1  -  a)  ( T -  p75  (2-17) 
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Notice  that  when  +  °°,  Tm  and  Tc  both  grow  to  infinity;  however 
in  the  limit  there  is  still  a  gain  in  using  the  cut-through  system. 

2.5  Some  Performance  Curves  and  Simulation  Results 

Figure  2.5  shows  the  number  of  cuts  as  a  function  of  p,  the 

channel  utilization.  The  network  configuration  and  traffic  pattern  is 

shown  in  Fig.  2.6.  For  a  single  channel  system  n£  is  a  linear  function 

of  p.  As  the  number  of  channels  increases,  the  curve  tends  to  take  a 

step  shape  and  in  the  limit  it  becomes  a  step  function.  This  figure 

also  shows  some  simulation  results  for  N  ,  =  1,  2  and  4.  The  statistics 

ch 

are  from  the  model  shown  in  Fig.  2.6.  It  is  a  tandem  queue  in  which, 
except  for  the  first  node  and  the  last  ones,  the  traffic  goes  through 
three  hops  (2  intermediate  nodes).  The  reason  for  selecting  this  top¬ 
ology  was  to  create  a  uniform  traffic.  Although  we  did  not  use  the 
independence  assumption  in  the  simulation  and  the  message  lengths  were 
constant,  the  match  between  the  simulated  and  the  analytic  results  is 
very  encouraging.  ^ 

'  ’Sfl  y 

Figures  2.7-a  through  2.7-f  sholji  the  normalized  delay  curves 

(T  pC  and  T  pC)  as  a  function  of  useful  channel  traffic.  Notice  that 
m  c 

throughput  is  not  necessarily  equal  to  the  effective  channel  traffic. 

They  are  related  to  each  other  according  to  Eqs.  (2.11)  and  (2.13). 

The  curves  are  for  a  number  of  hops  equal  to  3  and  different  values  of 
a(th/t0)  and  Pg  (channel  error  rate).  The  effect  of  header  length  can 
be  seen  by  comparing  Figs.  2.7-a  with  2.7-b,  2.7-c  with  2.7-d  or  2.7-e 
with  2.7-f.  It  is  seen  that  larger  header  lengths  degrade  performance  of 
the  cut-through  system.  The  effect  of  channel  error  rate  can  also  be 
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UTILIZATION 


Fig.  2.5  Average  Number  of  Cut-Throught. 


NORMALIZED  DELAY  (T/uC) 


NORMALIZED  DELAY  (TpiC) 


NORMALIZED  THROUGHPUT  (X/^C) 


Fig.  2.7-b  May  V*.  Throughput 
BL-3.P  -ao.a-.i 


NORMALIZED  DELAY  (T/iC) 


Fig.  2.7-c  Delay  V*.  Throughput 
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NORMALIZED  THROUGHPUT  (X/*iC) 


Flff.  2.7-e  May  Vt.  Throughput 

nh-3.Pt-°.05.°-0.0 


NORMALIZED  DELAY  (TfiC) 


Fig.  2.7-f  Dalay  V*.  Throughput 
nK-  3,  P.  -  .05.  a  -  0.1 


seen  by  comparing  these  six  graphs.  Although  channel  error  rate  affects 
performance  of  both  message-switching  and  cut-through  systems,  it  is 
interesting  to  note  that  this  parameter  has  a  more  serious  effect  in  the 
cut-through  system  than  in  the  message  switching  system.  In  fact  for  a 
very  noisy  channel  (large  P  )  performance  of  message  switching  may 
become  better  than  the  cut-through  systems  (for  the  parameter  of  these 
graphs  this  happens  approximately  when  >  0.4).  The  reason  for  this 
phenomenon  is  that  in  the  cut-through  system  if  a  retransmission  becomes 
necessary,  it  must  take  place  over  more  than  one  hop.  Eq.  (2.13)  makes 
this  fact  clear.  Fig.  2.7-g  shows  a  somehow  interesting  phenomenon;  here 
the  channel  error  rate  is  (unreasonably)  high,  P  =  0.45.  When  traffic 
is  low  the  chance  of  making  cut-throughs  is  high  but  the  received 
messages  have  higher  chances  of  having  bits  in  error,  thus  the  retrans¬ 
mission  overhead  and  hence  the  delay  is  high.  As  traffic  increases,  the 
cut-through  properties  are  less  effective,  but  since  that  saves  unneces¬ 
sary  retransmissions  the  network  delay  decreases.  Further  increase  in 
traffic  results  in  higher  delay;  however,  this  time  the  queueing  delay 
is  responsible  for  the  increase  in  delay. 

Perhaps  a  better  comparison  is  to  find  the  ratio  Tc/Tm 


(l-Pw)(l-a)Nch 

Nch  *  V'1  '  p) 


(2.18) 


This  we  have  done  in  Figs.  2.8-a  through  2.8-d. 

As  we  see  from  Fig.  2.8-a  when  a  =  0  and  p-  0  then  T  is  n. 

m  n 

times  larger  than  T  .  This  is  intuitively  clear  as  in  the  cut-through 
system,  messages  can  snake  through  all  of  the  intermediate  nodes  with¬ 
out  experiencing  any  delay  (recall  that  a=0);  however,  in  message 
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NORMALIZED  DELAY  (T^C) 


Fig.  2.7-g  (May  V*.  Throughput 

i^,"  3,  ■  0.46,  a*  0.1 
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Fig.  2.8-c  Ratio  of  tha  (May  in  tha  Cut- Through  Syttam 
to  tha  Dalay  in  tha  Mauaga  Switching  Synam. 


switching  the  delay  incurred  at  each  intermediate  node  is  at  least  t^, 
the  transmission  time  of  a  message  over  a  single  channel. 


Some  Limiting  Cases 


Let  us  reflect  on  the  limiting  value  of  T  /T  as  the  variables 

cm 

reach  their  limits  (0£a<^l  and  l^N^). 

When  n^ =  1  (e-g-,  in  a  fully  connected  net),  then,  as  we  expect, 

T  /T  =1  for  all  possible  values  of  p,  the  utilization.  Fig.  2.8-a 
cm 

illustrates  this  fact.  For  n,  t 00  the  ratio  becomes 

n 


Tc  ^  (l-Pw)(l-a)Nch 
Tm  "  "  Nch  +  V*1  - 


(2.19) 


The  interesting  point  is  that  even  though  T£  and  both  become 

unbounded  as  n,  +  00  their  ratio  has  a  well-defined  limit.  In  fact, 
n 

Eq.  (2.18)  shows  that  Tc/Tm  is  a  decreasing  function  of  n^. 

Looking  at  the  dependency  of  on  a,  Eq.  (2.18)  shows  that 

smaller  values  of  a  result  in  a  better  performance.  In  fact  for  a=  1, 
both  systems  behave  similarly.  This  behavior  is  shown  in  Fig.  2.8-b. 

The  dependency  of  Tc/Tm  on  N  ^  is  shown  in  Figs.  2.8-c  and 
2.8-d.  It  shows  the  intuitive  behavior  that  larger  values  of  N  ^ 
result  in  a  lower  value  of  T^/T^.  As  for  the  dependency  of  N^,  Tc 
and  T^  both  grow  as  grows;  however,  their  ratio  is  not  unbounded. 

Some  simulation  results  are  shown  in  Fig.  2.9.  The  simulated 
network  consists  of  the  tandem  queues  shown  in  Fig.  2.6.  The  discrep¬ 
ancy  between  the  simulated  and  the  calculated  values  of  T  and  T  is 

cm 

the  result  of  the  topology  of  the  network  and  of  the  message  length 


distribution.  The  analytic  results  are  for  exponentially  distributed 
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0.2  0.4  0.6 

NORMALIZED  THROUGHPUT  (A/m  C) 


Fig.  2-9  Som*  Simulation  RmuIu. 


message  lengths,  whereas  in  the  simulation  program  we  have  used  fixed 
length  messages. 

The  traffic  gain  is  shown  in  Figs.  2.10  and  2.11.  The  curves 
are  for  single  channel  lengths  (Eqs.  (2.15)  and  (2.16)).  For  higher 
values  of  N  ^  Eq.  (2.14)  does  not  lend  itself  to  a  nice  mathematical 
solution  for  p2  in  terms  of  pj  and  we  have  not  plotted  any  curve  for 
>  1;  however  the  same  behavior  is  expected. 

Fig.  2.10  shows  the  value  of  p2  in  term  pi,  whereas  Fig.  2.11 
shows  the  traffic  gain  (p2  -  Pi)  as  a  function  of  Pi.  Again,  these  curves 
show  that  higher  values  of  n^  or  smaller  values  of  a  result  in  a  better 
gain.  For  =  1  or  a  =  1  both  systems  behave  identically. 

2.6  Storage  Requirement 

In  a  store-and- forward  message- switching  system,  buffer  storage 
should  be  provided  at  the  nodes.  Even  though  buffer  cost  will  decrease 
significantly  in  the  future,  buffer  size  is  nevertheless  an  important 
item  of  the  overall  network  cost.  The  purpose  of  this  section  is  to 
calculate  storage  requirements  for  the  cut-through  system  and  compare 
it  with  the  requirements  of  the  message-switching  system.  Through¬ 
out  this  section  we  will  be  dealing  with  noiseless  channels.  We  start 
with  some  assumptions  regarding  buffer  allocation  in  the  two  systems. 

2.6.1  Buffer  Allocation  in  Message  Switching 

Storage  is  divided  into  buffers  of  equal  size  which  can  accomo¬ 
date  a  message  of  maximum  length.  Each  buffer  can  be  used  by  only  one 
message  at  a  time,  even  though  it  does  not  fill  the  entire  buffer. 

Within  a  node,  a  message  is  never  allocated  more  than  one  buffer.  With 
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this  buffer  management,  one  might  think  that  the  assumption  of  exponen¬ 
tially  distributed  message  length  should  be  replaced  with  a  truncated 
exponential  distribution.  However,  measurements  on  existing  networks, 
especially  on  the  ARPANET  [KLE1  74B],  show  that  the  average  message 
length  is  relatively  small  (250  bits  in  the  ARPANET).  With  a  reasonably 
large  buffer  size,  and  considering  the  fact  that  the  average  message 
length  is  much  smaller  than  the  buffer  size,  and  that  large  messages 
occur  very  infrequently,  we  are  justified  in  keeping  our  exponential 
message  length  assumption.  Regarding  the  messages  which  are  being  trans¬ 
mitted,  we  assume  that  when  the  first  bit  of  a  message  arrives  in  a  node, 
a  full  buffer  is  allocated  to  the  message.  This  means  that  all  messages 
in  transmission  use  two  buffers  simultaneously.  We  also  assume  that 
acknowledgement  is  instantaneous.  This  means  at  the  moment  a  message  is 
completely  received  at  a  node,  its  source  buffer  is  relinquished. 

2.6.2  Buffer  Allocation  in  Cut-Through  Switching 

There  are  two  types  of  buffers  in  this  system,  message  buffers 
and  header  buffers  (with  lengths  8.^  and  8.^) .  A  header  buffer  is  used  by 

a  message  at  a  node  which  it  is  cutting  through;  it  is  allocated  when 
the  first  bit  of  a  message  arrives  at  a  node.  After  the  reception  of 
the  header,  if  it  is  recognized  that  the  selected  outgoing  channel  is 
free,  no  further  storage  is  allocated  to  this  message  and  it  starts  being 
transmitted  out.  If,  however,  the  outgoing  channel  is  busy,  then  the 
message  must  be  completely  received  in  the  node  and  (the  remainder  of)  a 
message  buffer  is  allocated  to  it.  (This  happens  at  a  final  node,  i.e., 
the  destination  node  or  any  intermediate  node  where  a  cut  is  not 
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possible.)  Ne  notice  that  a  message  may  be  using  a  number  of  header 
buffers  simultaneously,  but  it  can  use  at  most  two  message  buffers  at 
one  time.  Regarding  the  acknowledgement,  we  assume  that  a  copy  of  a  mes¬ 
sage  is  kept  at  the  source  node,  or  at  any  intermediate  node  in  which  it 
is  blocked  (i.e.,  can  not  make  a  cut)  until  it  is  completely  received  at 
a  succeeding  final  node.  If  this  final  node  is  the  destination  node 
then,  after  completion  of  the  transmission,  both  message  buffers  are 
released  simultaneously.  If,  however,  this  final  node  is  not  the  desti¬ 
nation  node  then  a  copy  of  the  message  is  kept  in  this  final  node  and 
the  other  message  buffer  is  released.  Notice  that  after  the  reception 
of  the  first  acknowledgement  the  copy  of  the  message  at  the  source  node 
is  dropped  and  its  buffer  is  released. 

Let 

E [total  amount  of  storage  required  at  a  path  in  message  switching] 
=  E [total  amount  of  storage  required  at  a  node  in  message  switching] 

and  let  and  Sc  be  the  same  quantities  for  the  cut-through  system. 

As  before,  we  use  £  and  £,  for  the  message  buffer  and  header  buffer 
m  n 

size.  We  also  assume  that 

a  =  £^/£m  =  (avg.  header  length)/ (avg.  msg.  length) 

2.6.3  Storage  Requirement  for  Message  Switching 

At  any  instant  a  message  which  is  being  transmitted  occupies 
two  buffers.  The  other  messages  occupy  only  one  buffer,  so  we  have 

s(p)  =  |n(p)  +  n.  (N  .  p)}£  (2.20) 

m  lm  h  ch  J  m 
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where 


N^=  E[number  of  messages  on  a  path  for  message  switchingl  **  T  A 
m  m 

and  (Nc^p)is  the  average  number  of  messages  which  are  being  transmitted 

at  each  node.  From  Eq.  (2.20)  we  get 

s(p) 

s  =  — — 

m  — 
nh 

(2.21) 

Sm  =  (N  *  N  pH 
m  m  ch  m 

where  is  the  average  number  of  messages  at  each  node  which  are  on  the 
same  path  that  we  are  considering. 

2.6.4  Storage  Requirement  for  Cut-Through  Switching 
Let 

6  =  ceiling{l/a} 

=  ceiling{(msg  length) /(header  length)} 

(ceiling{x)  =  smallest  integer  larger  than  or  equal  to  x) .  Consider  a 
message  which  is  being  transmitted  along  a  path.  Every  time  that  the 
message  makes  a  cut  through  a  node,  it  is  allocated  a  header  buffer, 
i.e.,  the  total  amount  of  storage  allocated  to  it  is  increased  by  one 
header  buffer.  This  process  continues  until  the  number  of  the  consecu¬ 
tive  cuts  exceeds  6,  after  which,  for  each  new  header  buffer  that  the 
message  is  allocated,  it  releases  one.  Fig.  2.12-a  shows  this  phenomenon. 
The  solid  lines  show  the  amount  of  storage  allocated  to  the  message 
while  it  is  being  transmitted  along  the  path.  Fig.  2.12-b.  At  time  t^ 
the  message  starts  transmission  from  node  1,  the  source  node;  because 
we  have  assumed  propagation  delay  is  negligible,  at  the  same  time  a 
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header  buffer  is  allocated  to  the  message  at  node  2  (because  it  can  make 


a  cut  through  this  node) .  After  t,  seconds  the  header  is  received  com- 

n 

pletely  at  node  2,  at  which  time  it  starts  transmission  towards  node  3. 

By  assumption,  because  the  message  can  make  a  cut  through  node  3,  another 

header  buffer  is  allocated  to  it  instantaneously.  This  process  continues 

and  the  amount  of  the  allocated  storage  increases  up  to  node  5;  however, 

from  node  6  on  (at  time  t^),  the  tail  of  the  message  starts  leaving  nodes 

2,  3,  ....  and  for  each  new  header  buffer  it  occupies,  it  releases  one; 

hence  the  total  amount  of  storage  allocated  remains  unchanged.  At  time 

t4  the  message  arrives  in  node  8  and  a  header  buffer  is  allocated  to  it 

(so,  there  is  no  change  in  the  total  amount  of  buffers  allocated  to  this 

message  on  the  path).  But,  after  t,  seconds  (at  time  tc)  when  the  header 

n  i> 

is  received,  it  is  recognized  that  the  selected  outgoing  link  is  busy. 

So,  the  message  is  blocked  and  (the  remainder  of)  a  message  buffer  is 
allocated  to  it.  As  the  tail  of  the  message  leaves  the  intermediate 
nodes,  header  buffers  are  released.  When  the  message  is  received  in  full 
at  node  8  (at  time  t^),  the  copy  of  the  message  present  at  node  1  (the 
source  node)  is  dropped  and  its  buffer  is  released.  The  right  part  of 
Fig.  2.12-a  shows  the  situation  where  the  number  of  consecutive  cuts 
is  less  than  6. 

We  define  the  storage  occupancy  function  s(t)  as  follows: 
s(t)  =  amount  of  storage  allocated  to  a  message  at  time  t. 

If  we  approximate  the  step  function  s(t),  drawn  in  solid  lines 
in  Fig.  2.12-a,  by  the  broken  lines  in  the  same  figure,  we  can  observe 
a  very  useful  property.  The  shaded  areas  in  this  figure  are  equal.  The 
interesting  fact  is  that  when  the  number  of  consecutive  cuts  is  larger 
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than  6,  there  is  no  approximation  involved  (left  part  of  Fig.  2.12-a). 

We  use  this  property  in  calculating  .  To  do  so,  we  introduce  addi¬ 

tional  notations  g.  and  w.  as  follows: 

g^  =  length  of  the  i**1  transmission  period  of  a  message, 
w^  =  length  of  the  i**1  waiting  period  of  a  message. 


Using  the  above  observation,  we  have  the  following  identities: 


/ 


t~  +g . 
g.  5i 

1  s(t)  dt  =  2g  £ 
l  m 


t~ 


/ 


t~  +W  . 
W.  1 
1 


s(t)  dt 


w.  £ 
l  m 


t~ 

w. 

l 


where  t-.  is  the  time  that  the  i**1  transmission  starts  and  t~  is  the 


g 


.th 


w. 

l 


time  that  the  i  waiting  period  starts. 

Let  Wc  be  the  expected  total  amount  of  time  that  a  packet 
spends  waiting  and  Tc>  as  defined  before,  be  the  expected  total  network 
delay  (waiting  time  plus  transmission  time).  We  have  the  following 
relationships : 


and 


»c  ■  E[?  Si] 

1  J 

tc-e[E  (Bi-Si)] 
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Average  Storage  Requirement  of  one  message 

Efe  C2*.l  .S.lj] 


which  can  be  reduced  to 


There  are  AT 


where  T,  is  given  by  Eq.  (2.7).  Our  goal  is  to  find  the  quantity 


E [total  time  that  any  part  of  a 
message  is  being  transmitted 
on  any  channel] 


E  [number  of  times  that  a  packet  does  not 
enter  a  free  node] 


Proposition  2.1 


The  value  of  T  is  given  by 


Proof 


Each  time  that  the  message  makes  a  cut,  it  spends  t^  seconds 
for  transmission  (rather  than  tQ  seconds).  If  the  message  can  cut 
through  all  of  the  intermediate  nodes,  then  its  total  transmission  time 
will  be 

ro  +  (V1}th 

However,  at  each  node  where  the  message  is  blocked,  instead  of  t^,  t^ 
seconds  are  spent  for  transmission.  On  the  average  n^  times  a  packet  is 
blocked,  causing  (tQ  -  t^Jn^  secont^s  extra  transmission  time.  Adding 
the  two  terms,  we  get  Eq.  (2.23). 

Q.E.D. 

After  some  algebra,  and  recalling  that  a  =  t^/t0,  we  reduce 
Eq.  (2.23)  to 

TQ  =  [1  ♦  (nh-  l)a+  (1  -  a)(nh-  l)Pw]tQ  (2.24) 

for  the  special  cases  we  have 

Tn  =  n.  t.  when  P  =  1  (2.25) 

0  h  0  w 

T0  =  "0  +  ("h-1}th  whenPw=0  (2.26) 

In  Eq.  (2.25)  Tq  is  the  total  transmission  time  when  no  cut  is  possible 
(i.e.,  in  message  switching),  while  in  Eq.  (2.26)  is  the  transmission 

time  when  all  of  the  intermediate  nodes  are  cut  through. 

Combining  Eqs.  (2.2 2)  and  (2.24)  and  replacing  Tc  by  its  value 
•  *  m  I  m  l  2. 7) ,  we  get 

■  •  K".  *  V><"h  -  <2-27> 
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where  N^  is  the  average  number  of  messages  at  a  node  in  a  message  switch¬ 
ing  system.  For  the  average  buffer  requirement  at  each  node  we  have  the 
following  expression 


S  =  S(p)/n. 
c  c  h 


=  K  *  V’f1  -  2  <1-00(1 


-  '.J ! 


/  l 

)  m 


(2.28) 


Note  that  if  P  =1,  then  =  S^, 

w  c  m 


To  compare  storage  requirements  of  the  two  switching  methods 

we  study  the  behavior  of  the  ratio  6  =  S  /S 

c  m 


6  = 


lV*Nehp[I  '  2-hT'(1-“)(I-Pw)] 


nh 

N  +  N  ,  p 
m  civ 


(2.29) 


From  Eq.  (2.29)  it  is  clear  that  S  <  S  ,  and  equality  holds 

c  —  m 

only  when  P  =  1  or  a  =  1.  This  shows  that  the  performance  of  the  cut- 
through  system,  in  terms  of  storage  requirement,  is  always  better  than 
the  message-switching  system. 

The  ratio  0  is  a  decreasing  function  of  N  ,  ,  the  number  of 

ch 

channels.  For  t  °°  we  have  the  limiting  value  of  9  given  below: 


n,  -  1  1  +  (n,  -  l)a 

lim(e)  =  1  -  -2- -  (l-o)  =  - - - 

Nch+~  n. 


(2.30) 


n. 


h  “h 

which  is  independent  of  the  utilization  factor  p.  Fig.  2.13  shows  the 
behavior  of  9  as  a  function  of  N  ^  for  different  values  of  p.  Notice 
that  the  true  limiting  value  should  be 

2  +  (nh-  l)ct  1  +  0.5(nh-  l)a 
2n.  n. 
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irition  of  Storage  Requirement  in  the  Cut-Throu(0> 
i  and  the  Menage  Switching  System. 


which  can  be  found  by  considering  the  fact  that  in  the  limit  (Nch  +  ao) 
there  is  no  waiting  message  in  the  system  (P  ■  0).  In  the  case  of  cut- 
through  switching,  all  of  the  messages  are  cutting  through;  and  in  the 
case  of  message  switching,  all  of  the  messages  in  the  system  are  in 
transmission.  The  discrepancy  is  the  result  of  our  approximation  in 
calculating  (see  Figs.  2.12-a  and  b) . 

The  limiting  value  found  by  Eq.  (2.30)  is  0.24,  which  should  be 
compared  with  the  true  value  of  0.22. 

2.7  A  Design  Problem 

So  far  we  have  been  dealing  with  the  analysis  of  the  cut-through 
switching  technique.  Our  study  has  shown  that  this  technique  is,  in 
fact,  a  satisfactory  solution  to  the  switching  problem  in  communication 
nets.  It  takes  advantage  of  the  good  properties  of  message  switching 
by  simultaneously  using  multiple  links  along  a  path.  It  exhibits  nice 
properties  of  circuit  switching  by  not  incurring  unnecessary  inter¬ 
mediate  nodal  delay  for  unnecessary  store-and-forward. 

In  this  section  we  look  at  a  design  problem  for  this  switching 
technique.  Several  other  formulations  of  design  problems  for  packet¬ 
switching  systems  can  be  found  in  the  literature  (e.g.,  [KLEI  64], 

[GERL  73A]  and  [KAMO  76B];  along  with  these  formulations,  solutions  have 
been  proposed. 

The  selection  of  appropriate  design  criteria  for  communication 
nets,  by  its  nature,  is  an  involved  task.  The  difficulty  stems  from  the 
multiplicity  of  parameters,  hence  numerous  choices,  at  the  disposal  of  a 
network  designer.  It  is  not  our  intention  to  deal  with  the  design 


67 


problem  thoroughly  since  the  literature  is  already  rich  with  solutions 
for  different  versions  of  this  problem.  We  assume  that  certain  param¬ 
eters,  which  are  usually  considered  as  design  parameters,  are  selected. 

In  particular,  we  assume  that  topology,  channel  capacity  and  routing 
matrix  are  already  known  to  us.  The  basic  design  variable  for  us  will 
be  the  number  of  channels  per  link.  In  Fig.  2.7  we  have  already  shown 
the  effect  of  this  parameter  on  the  network  delay  (which  so  far  has  been 
the  main  measure  of  performance  for  us) .  The  significance  of  the  number 
of  channels  becomes  clearer  when  we  talk  about  our  other  performance 
criteria. 

For  performance  we  consider  three  measures:  network  delay, 
throughput  and  the  probability  that  a  message  can  cut  through  all  of  the 
intermediate  nodes  along  its  path.  The  significance  of  the  first  two 
measures  is  clear;  however,  since  the  last  criterion  is  somehow  unusual, 
we  choose  to  discuss  it  further. 

We  have  noticed  that  in  some  situations  the  cut-through  system 
resembles  a  circuit-switched  system.  This  specific  property  is  desirable 
for  certain  applications,  for  example,  real-time  systems  or  those  appli¬ 
cations  which  require  permanent  "physical"  connections  between  remote 
stations.  If  a  message  can  cut  through  all  of  the  intermediate  nodes  on 
a  path,  then  we  can  logically  assume  that  there  is  a  "physical"  circuit 
between  its  source  and  destination  nodes.  This  is  the  reason  for  the 
significance  of  the  last  measure.  The  probability  of  this  event  happen¬ 
ing  is  easy  to  find.  In  fact  we  have 
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Given:  Network  topology,  channel  capacity,  routing  matrix 
Maximi ze :  Throughput 

with  respect  to  the  number  of  channels  per  link 

A 

Subject  to:  T  <  T  Delay  constraint  (T  =  T  yC) 

C  mclX  c  c 

P  Blocking  probability  constraint. 


Notes: 

(1)  We  have  assumed  that  by  specifying  the  topology  we  also  specify 
the  path  length  under  consideration,  hence  we  can  use  Eq.  (2.31) 
for  P  The  average  probability  of  no  blocking  for  the  entire 
network  can  only  be  specified  when  we  know  the  exact  topology 
and  routing  matrix  (a  simple  calculation  shows  that  for  the 
entire  network,  P  ^  =  N[(l  -  Pw)]/(1  -  Pw) ,  N[»]  being  the  gen¬ 
erating  function  of  path  lengths). 

(2)  As  pointed  out  in  previous  sections,  we  assume  noiseless  chan¬ 
nels,  hence  we  can  use  Eq.  (2.7)  to  find  f£. 

(3)  The  above  formulation  is  by  no  means  a  unique  formulation.  In 
fact  any  one  of  the  performance  measures  (p,  or  P  ^)  can  be 
selected  as  the  optimizing  measure  and  the  other  two  as 
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constraints;  however  in  all  of  these  cases,  the  results  are 
essentially  the  same. 


The  design  problem  stated  above  can  be  solved  by  mathematical 

programming.  However,  reflection  on  the  expression  for  P  ,  Eq.  (B.3), 

shows  that  all  of  the  performance  measures  are  complicated  functions  of 

the  number  of  channels,  N  .  .  For  this  reason  an  exact  solution  is,  if 

ch 

not  impossible,  extremely  difficult  to  find;  hence  we  will  present  a 
heuristic  solution  to  this  problem. 

It  is  clear  that  for  a  given  throughput,  a  smaller  number  of 
channels  per  link  (N^)  results  in  a  lower  delay;  however,  it  also 
results  in  a  lower  P  ^  (Figs.  2.14-a  and  2.14-b).  Therefore  the  number 
of  channels  has  opposing  effects  on  these  two  measures.  The  following 
algorithm^tries  to  find  the  smallest  number  of  channels  per  link  which 
satisfies  both  constraints. 


If  P  ,  (m,  oi)  <  e  go  to  step  4 
nb 

If  P  ,  (m.  Pi)  >  e  go  to  step  5 
nb 

If  P  .(m,  pi)  =  e  set  p=Pi,  go  to  step  6 
nb 

4  ‘  (Pnb^m*  Pl)  <  E) 

Find  p2  such  that  pnl3(m»  P2)  55  £.  Set  p=P2*  go  to  step  6. 
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5  ■  (*W"'  Pl)  >  e  > 

If  m  =  1  set  p  =  pi f  go  to  step  6. 

If  m  >  1  set  m  *  i-l,  go  to  step  2. 

6  -  Set  N  ^am.  p  has  already  been  determined.  From  p  =  A/pC 

we  can  find  either  A  or  uC  by  knowing  the  other. 

Discussion: 

A 

We  have  used  Tc(m,  p)  to  indicate  the  normalized  network  delay 
when  the  number  of  channels  is  m  and  the  utilization  factor  is  p.  The 
same  notation  holds  for  P^tm.  P)  • 

In  step  1,  m  is  the  largest  number  of  channels  which  may  be 
used.  Any  number  greater  than  this  will  violate  the  delay  constraint. 
Pi  in  step  2  is  the  maximum  utilization  achievable  when  the  number  of 
channels  is  m.  In  step  4,  pi  is  not  achievable;  however,  m  is  the  best 
choice.  On  the  other  hand.  Pi  in  step  5  is  achievable  but  m  may  not  be 
optimal  (i.e.,  it  may  be  possible  to  use  a  smaller  m). 

Algorithm  2.1  is  still  valid  for  the  case  when  we  have  some 
restrictions  on  the  choice  of  the  number  of  channels,  e.g.,  when  the 


number  of  channels  should  be  chosen  from  a  finite  set  of  numbers. 


^ch  ’  ^ch  *  •••»  Nch  ^  that  t*ie  case*  only  steps  1  and  5  have 
to  be  changed  such  that  the  next  smaller  channel  number  is  selected. 


Consider  the  following  helpful  example.  For  a  given  network  we 
have  n^=  3,  Pg =  0  and  a=  0.  We  also  have  the  following  choices  for  the 
number  of  channels  per  link:  1,  2,  3,  4,  8,  10,  15  and  20.  The  con¬ 
straints  are  T£  <  10.5  (normalized  delay)  and  P  ^  >  0.85.  Figs.  2.14-a 
and  2.14-b  show  the  delay  and  P  ^  curves  for  this  network.  The  curves 
are  plotted  for  different  numbers  of  channels  per  link.  We  carry  out 
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"BET 


PROBABILITY  OF  NO  BLOCKING  (P 


algorithm  2.1  as  follows. 

From  step  1  we  recognize  that  m=  10  is  the  maximum  number  of 
channels  which  we  can  use.  Step  2  leads  us  to  point  a  in  Fig.  2.14-a 
with  utilization  factor  0.44.  For  this  network  with  N  ^  =  10  and  p=0.44, 
will  be  0.96  (point  a  in  Fig.  2.14-b).  We  now  observe  that  we  can 
improve  the  performance  because  10  is  too  large  for  the  number  of  chan¬ 
nels.  The  next  smaller  choice  is  1^=8.  Again,  from  step  2  we  are  led 
to  point  b  in  Fig.  2.14-a  with  utilization  factor  0.57.  However,  for 
=  8  and  p=0.57  we  have  Pn^=0.8  (point  b  in  Fig.  14-b)  which  violates 
the  second  constraint.  By  Algorithm  2.1  (step  4)  we  know  that  is  the 
best  choice.  The  achievable  throughput  is  found  from  Fig.  2.14-b  (point 
c  on  this  figure).  This  value  is  p  =  0.52  and  the  corresponding  delay 
is  9. 

2.8  Conclusion 

In  this  chapter  we  proposed  and  analyzed  a  new  switching  system 
named  "virtual  cut-through."  In  this  switching  system  a  message  is  buf¬ 
fered  in  an  intermediate  node  only  if  necessary  (due  to  busy  channels) , 
hence  the  delay  due  to  unnecessary  buffering  in  front  of  an  idle  channel 
is  avoided.  If  a  message  cuts  through  all  of  the  intermediate  nodes, 
the  system  becomes  similar  to  circuit  switching.  On  the  other  hand,  if 
a  message  encounters  busy  channels  at  all  of  the  intermediate  nodes,  the 
outcome  is  the  same  as  for  message  switching.  Therefore,  cut-through 
switching  is  essentially  a  hybrid  mixture  of  circuit-switching  and  mes¬ 
sage-switching  techniques.  It  takes  advantage  of  the  good  properties  of 
message  switching  by  simultaneously  using  multiple  links  along  a  path; 
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meanwhile,  it  exhibits  desirable  properties  of  circuit  switching,  by  not 
incurring  unnecessary  intermediate  node  delay  for  store-and-forward. 

Our  analysis  showed  that  cut-through  switching  is  superior  to 
message  switching  from  at  least  three  points  of  view:  network  delay, 
traffic  gain  and  buffer  storage  requirement.  In  general,  at  the  same 
traffic  level,  the  network  delay  in  the  cut-through  switching  system  is 
less  than  in  message  switching.  Our  analysis  in  Section  2.3  showed  that 
only  when  the  channel  error  rate  is  too  high  may  the  message  switching 
delay  become  less  than  the  cut-through  system  delay.  In  Section  2.4  we 
showed  that  at  the  same  network  delay,  cut-through  switching  can  carry 
more  throughput  than  message  switching,  and  in  Section  2.6  we  demon¬ 
strated  that  the  storage  requirement  in  cut-through  switching  is  less 
than  in  message  switching.  When  a  message  can  cut  through  all  of  the 
intermediate  nodes,  the  network  path  becomes  similar  to  a  physical  con¬ 
nection  between  two  remote  stations.  This  property  is  desirable  for 
certain  applications.  In  Section  2.7  we  developed  an  algorithm  which 
can  maximize  the  throughput  while  guaranteeing  certain  probability  that 
messages  can  cut  through  all  of  the  intermediate  nodes  on  a  path. 

Our  study  in  this  chapter  showed  that  cut-through  switching  is 
a  good  solution  to  the  switching  problem  in  a  communication  network.  In 
Chapter  3  we  will  show  that  in  many  cases  the  performance  of  cut -through 
switching  is  even  better  than  that  of  circuit  switching. 

We  did  not  study  the  physical  implementation  of  this  system, 
and  we  also  did  not  discuss  the  protocol  design  for  this  switching 
technique.  These  areas  require  further  research. 
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CHAPTER  3 


A  TRADEOFF  STUDY  OF  SWITCHING  SYSTEMS 


The  previous  chapter  demonstrated  that  depending  on  the  traffic 
of  a  data  communication  network,  considerable  delay  and  storage  reduc¬ 
tion  can  be  achieved  by  using  the  virtual  cut-through  switching  (CTS) 
technique.  Throughout  that  chapter  we  compared  performance  of  CTS  with 
message  switching  (MS)  and  our  study  showed  that  in  terms  of  network 
delay,  CTS  never  performs  worse  than  MS.  In  this  chapter  we  are  inter¬ 
ested  in  extending  the  scope  of  our  comparison  to  include  circuit 
switching  (CS) . 

A  tradeoff  study  of  switching  systems  involves  consideration 
of  a  wide  range  of  issues;  however,  in  most  cases  cost  and  delay  are 
by  far  the  two  major  criteria.  Network  cost  consists  of  installation 
and  maintenance  cost  (from  the  owner's  point  of  view)  and  service  cost 
(from  the  users'  point  of  view).  Consideration  of  cost  requires  either 
specializing  the  study  toward  a  specific  system  or  strong  assumptions 
regarding  the  physical  structure  and  hardware  costs  of  the  switching 
nodes  and  the  communication  media.  With  the  rapid  advances  of  today's 
technology,  with  a  trend  toward  lower  cost  and  better  performance,  any 
assumption  on  (even  relative)  cost  of  hardware  may  soon  become  obsolete. 
For  the  above  reasons  we  choose  not  to  base  our  comparison  study  on  cost 
and  consider  the  network  delay  as  the  only  comparison  criterion. 

In  the  last  chapter,  analytic  models  were  developed  to  predict 
the  throughput-delay  profile  of  MS  and  CTS.  In  this  chapter  we  first 


formulate  a  network  model  for  delay  in  a  circuit-switched  system;  how¬ 
ever,  no  simple  solution  to  the  transform  equation  which  results  from 
this  model  has  been  found.  Next,  we  generalize  the  independence  assump¬ 
tion  [KLEI  64]  and  develop  an  analytic  model  for  the  delay  of  a  communi¬ 
cation  path  between  two  users  in  a  CS  network.  A  somehow  similar  model 
was  formulated  by  Port,  et  al.  [PORT  71].  Although  the  model  is  based 
on  a  weakened  assumption,  it  shows  rather  interesting  behavior  for  CS. 
Having  thus  developed  expressions  for  delay  in  CS,  we  compare  the  per¬ 
formance  of  the  three  switching  systems. 

Before  giving  the  analysis,  we  review  the  literature. 

3. 1  History  and  Related  Work 

With  the  birth  of  the  first  practical  packet-switching  network, 
the  ARPANET  [HEAR  70],  [KLEI  70],  [FRAN  70],  [CARR  70]  and  [ROBE  70] 
in  1969,  we  witnessed  an  explosion  of  activities  in  the  field  of  computer 
and  data  communication  networks  [KLEI  76B ] -  Numerous  symposia  and  con¬ 
ferences  have  been  devoted  to  this  field  and  the  interest  is  still  grow¬ 
ing.  In  fact,  packet  switching  was  an  answer  to  many  unsolved  problems 
of  data  communications  and  it  is  not  surprising  to  have  witnessed  an 
over-zealous  enthusiastic  claim  in  its  favor  during  the  early  years 
after  its  birth.  However,  there  was  already  a  large  investment  in  tele¬ 
phone  networks,  a  technology  based  on  circuit  switching.  After  a  period 
of  silence,  those  in  favor  of  circuit  switching  started  challenging  the 
efficiency  of  packet  switching.  The  main  argument  in  favor  of  CS  was 
based  on  the  intuitive  fact  that  if  there  is  a  need  for  transmitting  a 
long,  continuous  stream  of  data,  then  a  leased  line  (or  circuit  switch) 


communication  makes  good  sense.  Circuit-switched  and  telephone  networks 
were  well-studied  systems  and  there  was  an  enormous  body  of  literature 
available  for  different  aspects  of  their  operations  ([WILK  56],  [SYSK  60] 
and  the  references  therein);  however,  in  all  studies  these  systems  are 
viewed  as  loss  systems.  This  means,  once  a  request  for  service  (in  this 
case  a  call)  finds  no  available  trunk  channel,  it  is  dropped  from  the 
system  and  is  lost.  However,  a  computer  communication  network  is  basi¬ 
cally  a  delay  system  in  which  the  two  parties  of  a  communication  can 
wait,  if  necessary,  for  a  connection  to  be  established  between  them. 

In  order  to  compare  the  performance  of  the  two  switching  techniques  on 
a  concrete  basis,  there  was  a  need  to  develop  a  delay  model  for  CS. 

This  was  done  by  devising  a  signalling  mechanism  for  CS,  operating  on  a 
delay  basis.  Based  on  this  model,  some  studies  have  been  conducted  in 
an  attempt  to  compare  these  switching  techniques  [PORT  71],  [CLOS  72A,B], 
[CLOW  73],  [ITOH  73],  [MIYA  75],  [ROSN  76]  and  [KUMM  76A,B].  In 
[PORT  71]  it  was  shown  that  the  "signalling  time"  (which  is  different 
from  total  network  delay)  in  a  CS  network  for  all  the  required  tandem 
circuits  on  a  path  to  simultaneously  become  free  is  often  shorter  than 
the  waiting  time  in  a  series  of  high-speed  tandem  channels  in  a  message- 
switched  network.  This  paper  was  devoted  to  the  issue  of  signalling 
delay  and  was  one  of  the  earliest  attempts  to  challenge  the  superiority 
of  packet  switching  on  a  quantitative  basis.  Later  reports  [CLOS  72A,B] 
are  extensions  of  this  work.  In  [ITOH  73]  a  detailed  comparison  based 
on  the  network  delay  between  the  switching  schemes  is  made.  This  work 
confirmed  the  intuitive  understanding  that  at  higher  traffic  rates  store- 
and-forward  switching  results  in  a  better  delay  performance,  whereas  for 
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longer  message  lengths,  CS  is  superior  to  the  other  switching  technique. 
In  [KUM  76A,B]  a  comprehensive  comparison  study  was  done  from  two  points 
of  view:  delay  performance  and  usage  cost.  This  study  again  confirmed 
the  previous  results. 

In  passing,  we  should  point  out  that  almost  all  of  the  previous 
works  (except  [PORT  71]  have  neglected  to  include  the  effect  of  channel 
reservation  in  their  analytical  modeling  of  CS.  The  fact  is,  channel 
reservation  degrades  the  performance  of  CS  severely  in  the  sense  that 
networks  under  CS  operation  saturate  very  fast.  As  we  shall  see  shortly, 
an  exact  analysis  of  this  phenomena  is  extremely  difficult;  however,  by 
accepting  some  simplifying  assumptions  (which  were  first  introduced  in 
[P0RT71])  we  are  able  to  develop  a  more  accurate  analytic  model  than  has 
been  used  in  the  studies  so  far. 

The  aim  of  the  present  study  is  to  give  a  reasonably  realistic 
and  quantitative  performance  of  the  three  switching  systems,  CTS,  MS 
and  CS.  As  we  did  in  the  last  chapter,  we  do  not  differentiate  between 
packet  switching  and  message  switching;  we  consider  both  methods  as  mem¬ 
bers  of  the  larger  class  of  store -and -forward  switching  systems. 

3.2  A  Delay  Model  for  Circuit  Switching 

We  use  the  following  model  for  a  circuit-switching  system  which 
operates  as  a  "delay"  system.  This  model  is  similar  to  the  models  used 
in  the  work  referred  to  in  the  previous  section. 

In  this  model  a  message  waits  (rather  than  being  lost)  at  the 
source  node  and  sends  a  signal  toward  its  destination  (request-for- 
connection).  While  travelling  node  by  node  towards  the  destination  node. 
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the  signal  reserves  channels  along  the  path.  If  at  any  intermediate 
node  it  cannot  find  a  free  channel,  while  holding  the  channels  it  has 
reserved  so  far,  it  waits  for  a  channel  to  become  free,  at  which  time 
the  signal  reserves  it  and  goes  to  the  next  node  to  repeat  the  same 
process.  By  the  time  the  signal  reaches  the  destination  node,  a  path 
has  been  reserved  between  the  source  and  the  destination  nodes.  Figure 
3.1  shows  the  structure  of  a  node  in  a  communication  network  (Fig.  3.2) 
in  which  the  nodes  are  connected  via  two  sets  of  channels,  message  chan¬ 
nels  and  signalling  channels.  A  message  from  outside  the  network  arrives 
in  box  (or  queue)  A  and  sends  a  (request-for-connection)  signal  to  estab¬ 
lish  a  path  between  this  node  (S)  and  its  destination  node  (0).  Box  B 
is  a  queue  in  which  signals  wait  to  reserve  one  of  the  message  channels 
between  nodes  S  and  j  (service  facility  C) .  Having  reserved  a  channel, 
the  signal  joins  queue  D,  the  signalling  queue,  and  waits  for  transmis¬ 
sion  over  the  signalling  channels  E  to  join  node  j.  At  node  j  the  signal 
goes  through  the  same  process  until  it  arrives  at  the  destination  node  D. 
When  the  signal  reaches  its  destination,  its  originating  message  is 
notified  (through  a  reverse  signalling  process  called  request-for- 
transmission) ,  at  which  time  the  message  can  start  transmission.  Note 
that  (only)  during  the  message  transmission  are  all  of  the  channels  on 
the  path  used  simultaneously,  hence  this  last  transmission  is  similar  to 
a  one-hop  transmission.  After  the  message  transmission  is  completed, 
the  reserved  channels  are  released.  We  assume  that  no  further  signalling 
is  required  for  releasing  the  reserved  channels. 

The  above  model  is  usually  referred  to  as  a  "forward  reservation- 
common  signalling  channel."  Reservation  of  message  channels  can  be  done 
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while  the  signal  is  coming  back  from  the  destination  node  (backward 
reservation).  Signalling  channels  can  also  be  integrated  into  message 
channels,  which  is  called  "individual  signalling."  In  this  case,  when 
a  message  channel  is  reserved  by  a  signal,  the  signal  is  sent  through 
the  reserved  channel.  If  that  is  the  case,  then  queue  D  and  service 
facility  E  in  Fig.  3.1  lose  their  identities  and  are  integrated  in 
queue  B  and  service  facility  C,  respectively. 

3.3  An  Exact  Analysis 

In  this  section  we  analyze  the  "forward  reservation  -  common 
signalling  channel"  model  of  the  CS  system  of  the  last  section.  To 
simplify  the  analysis  we  disregard  the  backward  signalling  process,  i.e., 
we  assume  that  when  a  path  for  a  request  is  set  up,  the  requesting  mes¬ 
sage  starts  transmission  immediately. 

In  ordeT  to  find  the  network  delay,  we  must  first  calculate 
the  nodal  delay  (the  time  a  signal  spends  in  a  node  to  reserve  a  chan¬ 
nel.)  Once  this  latter  quantity  is  determined,  calculation  of  the 
network  delay  is  a  trivial  task.  As  we  will  see  shortly,  determination 
of  the  nodal  delay  is  a  nontrivial  task  and  throughout  this  section  we 
will  discuss  this  problem  exclusively. 

An  abstract  model  of  a  node  is  shown  in  Fig.  3.3.  The  waiting 
buffer  A  is  missing  in  this  picture,  as  it  is  not  needed  in  our  analysis. 
The  system  in  Fig.  3.3  consists  of  two  tandem  queues.  Stage  I  (waiting 
room  B  and  service  facility  C)  is  for  reservation  and  stage  II  (waiting 
room  D  and  server  E)  for  signalling.  Service  facility  C  is  a  fictitious 
server  in  the  sense  that  when  a  signal  reaches  the  head  of  queue  B  and 


STAGE  I 


STAGE  II 


B  -  WAITING  ROOM  TO  RESERVE  A  FREE  CHANNEL 
C  -  FICTITIOUS  SERVICE  FACILITY  (MESSAGE  CHANNELS) 
D  -  WAITING  ROOM  TO  TRANSMIT  SIGNALS 
E  -  SIGNAL  CHANNELS 


Figure  3.3  Abstract  Structure  of  •  Nod*. 
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reserves  a  channel,  it  is  no  longer  delayed  in  server  C  and  joins  queue 
D  directly.  However,  the  channel  reserved  by  this  signal  is  held  and 
cannot  be  used  by  others  while  it  is  establishing  the  rest  of  the  path 
and  during  the  transmission  of  its  originating  message.  So,  the  waiting 
time  Wj  and  the  total  system  time  s^  (waiting  plus  service)  of  this  sys¬ 
tem  are  equal,  i.e.,  w^  =  s^.  Stage  I  can  be  viewed  as  a  G/G/m  queueing 
system  in  which  the  service  distribution  is  the  entire  holding  time  of  a 
channel  (i.e.,  time  to  set  up  the  path  plus  time  for  transmission  of  the 
message) . 

In  order  to  find  the  nodal  delay,  we  must  calculate  the  queueing 
delay  for  the  reservation  of  a  channel.  The  queueing  system  G/G/m  of 
stage  I  has  not,  in  general,  been  analyzed  in  the  literature.  In  the 
following  we  outline  some  analyses  which  we  have  done.  We  start  by 
making  the  following  assumption. 

Assumption  3.1 

The  queueing  process  at  each  node  is  independent  of  the 
other  nodes  and  within  each  node  the  stochastic  processes  of 
stage  I  and  stage  II  are  independent  of  each  other  and  the 
network  is  balanced  (Definition  2.1). 

With  this  (compound)  assumption,  all  nodes  behave  identically 
and  independently. 

We  now  define  some  notation  in  addition  to  that  of  Chapter  2. 

In  the  following,  a  starred  function  denotes  the  Laplace  transform  of 
a  distribution  function.  Each  random  variable  is  related  to  the  Laplace 
transform  of  its  distribution  function  by  and  to  its  average  value 
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by  A  letter  with  a  bar  over  the  top  indicates  the  average  of  a 

random  variable. 


s*(.)  <=>  Sj 
W*(*)  <=>  *  <— ►  Wj 


system  time  at  stage  I  (reservation 
queue.  Fig.  3.3) 

waiting  time  at  stage  I 


s*(.)  <=>  s2  s2 

s*(.)  <-=>  s  <— >  s 

H*  ( •)  <->  h  <-*  h 


system  time  at  stage  II  (signal 
transmission  queue,  Fig.  3.3) 

total  nodal  delay.  Notice  s  sSj  +  i^, 
and  by  the  assumption  stated  above 
S*(s)  =  W}(s)S*(s) 

holding  time  of  a  channel 


>;<•) <_>  Eo  io 

B.(.)  <_>  th  <-  th 

N(z)  <=>  nh  •«— *  nh 


transmission  time  of  a  message  on  a 
message  channel 

transmission  time  of  a  signal  on  a 
signal  channel 

number  of  nodes  (or  hops)  on  a  path 


~r  -r 
%  ~nh 


remaining  number  of  nodes  on  a  path 
from  a  randomly  chosen  node. 


Consider  a  randomly  chosen  node  on  the  path  between  two  communicating 
source  and  destination  nodes  (say  node  j  on  the  path  between  nodes  S  and 
I)  in  Fig  3.2);  by  definition,  there  are  n^  nodes  remaining  on  the  path 
between  node  j  and  the  destination  node  D.  We  wish  to  study  the  holding 

time  of  the  channel  between  nodes  j  and  k  (the  next  node  on  the  path 

between  node  j  and  D) .  After  a  channel  is  reserved  between  nodes  j  and 

k  ,  the  signal  is  transmitted  to  node  k  (taking  $2  units  of  time). 


then  it  reserves  the  remaining  (n^  -  1)  channels  on  the  path,  at  which 
time  the  message  can  begin  transmission  (taking  units  of  time;  recall 
that  the  backward  signalling  is  disregarded).  The  channel  between  nodes 
j  and  k  is  in  use  during  all  of  the  above  process.  Based  on  what  we 
said,  we  have  the  following  expression  for  the  holding  time  h: 


h  =  s2  +  (n£-  l)s  *  tQ 


(3.1) 


Using  the  residual  life  argument  [KLEI  75  ],  we  find  the  distribution  of 
-r 

n.  as 


Pr[n^  =  n'l  =  £pr[n.=k]  (3.2) 

L  n  J  n.  fen  n 

n 


By  the  virtue  of  the  assumptions  stated  before,  we  can  find  the  condi¬ 
tional  Laplace  transform  for  the  distribution  of  h  as  follows: 


H*(s|nJ=n)  =  S*2(s)[S*(s)]“"aB*(s) 


n-1. 


(3.3) 


Using  Eq.  (3.2)  to  remove  the  condition  on  n£,  we  have 


»l*(s)  =  £  H*(s  |  n£=  n)Pr[n£=  n] 
n>l 

=  E  S*(s)(S*(s)Jn_1B*(s)  i-  £  Pr [n.  =  k] 
nsi  £  nh  k*n 

and  after  some  algebra  we  get 


H*(s) 


Bm(s)S2(s) 

nh[l-S*(s)] 


(1 


NLS*(s)]} 


B*(s)S*(s) 

- - 2 - - -  {1  -  N[S*(s)5*(s)]}  (3.4) 

n  [l-S*(s)S*(s)] 
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We  can  now  find  the  average  holding  time  as  follows 


li 


s=0 


or 


(3.5) 


where  n^  is  the  second  moment  of  the  number  of  hops  on  a  path.  For  the 
special  case  where  all  of  the  messages  go  through  the  same  number  of 
hops,  Eq.  (3.5)  is  reduced  to 


(3.6) 


However,  s  =  s^  +  S2  =  +  s2*  50  we  have 


h  = 


nh  ♦  1 

t0  +  2  s2 


(3.7) 


Eqs.  (3.4)  through  (3.7)  give  us  relationships  in  the  G/G/m  queueing 
system  of  the  reservation  operation. 

In  Eq.  (3.4),  B*(*)  is  known  and  N(*)  can  be  calculated  from 
the  routing  matrix  (if  the  routing  is  deterministic)  and  the  topology 
of  the  network;  morever,  S*(s)  can  be  found  by  a  reasonable  assumption 
for  the  operation  of  the  signalling  channels.  Hence  Eq.  (3.4)  gives  us 
a  relationship  between  the  service  time  (or  the  holding  time)  h  and  the 
waiting  time  w^  of  the  G/G/m  queueing  system  under  study.  When  the  num¬ 
ber  of  hops  for  all  paths  is  the  same,  Eq.  (3.7)  gives  a  relationship 
between  the  average  values  of  these  quantities.  In  order  to  find  h  and 
w^  explicitly,  we  need  another  relationship  between  these  two  variables. 
So  far,  we  have  not  been  successful  in  finding  such  a  relationship. 
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To  verify  the  accuracy  of  Eq.  (3.7),  we  simulated  the  system. 

The  topology  of  the  simulated  network  is  shown  in  Fig.  3.4.  This  is 
essentially  similar  to  the  topology  which  we  used  for  the  simulation  of 
Chapter  2  (Fig.  2.6),  with  the  difference  that  there  are  30  nodes  on  the 
path.  Message  trunk  capacity  is  50  kbps  which  is  divided  into  a  fixed 
number  of  channels  (1,  2  and  4  in  the  simulation  experiments).  There 
is  only  one  signal  channel  between  two  adjacent  nodes  with  8  kbps 
capacity  and  the  input  processes  from  the  external  sources  are  exponen¬ 
tial.  Messages  and  signals  are  of  constant  lengths;  1000  bits  and  100 
bits,  respectively.  We  used  values  of  S£  and  Wj  from  the  simulation  and 
used  Eq.  (3.7)  to  calculate  the  average  holding  time.  This  calculated 
value  was  compared  with  the  average  holding  time  obtained  from  the  simu¬ 
lation.  Fig.  3.5  shows  the  result  of  this  comparison.  In  this  figure 
when  a  curve  does  not  continue  to  the  right  beyond  a  certain  value  of  A, 
the  input  rate,  it  indicates  that  the  network  is  saturated.  The  useful 
utilization  of  links  (A/yC)  is  also  shown.  It  is  seen  that  the  network 
saturates  at  relatively  low  input  rates,  a  result  of  channel  reservation. 
It  can  also  be  seen  that  the  network  capacity  increases  as  we  increase 
the  number  of  message  channels  (with  the  same  trunk  capacity).  In  a 
later  section  of  this  chapter  we  will  show  this  phenomenon  more  explic¬ 
itly.  Fig.  3.5  shows  that  the  values  of  h  and  Wj  obtained  from  the 
simulation  fit  Eq.  (3.7)  very  closely.  This  match  is  very  encouraging; 
however,  a  complete  analytic  treatment  is  still  lacking. 


RELATIVE  ERROR 


(b)  .06  .12  .18  .24  .30  .36  .42  .48 


(a)  EXTERNAL  INPUT  RATE  (MSG/SEC) 

(b)  USEFUL  UTILIZATION 

MESSAGE  CHANNEL  CAPACITY  50  KBS 

SIGNAL  CHANNEL  CAPACITY  8  KBS 

MESSAGE  LENGTH  1000  BITS 

SIGNAL  LENGTH  100  BITS 


Figure  3.5  Accuracy  of  Analytic  Result. 
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3.4  Analysis  of  a  Path  Model  for  the  Circuit-Switching  System 

The  solution  of  the  network  model  of  the  previous  section 
required  treatment  of  a  multiple  server  queueing  system  with  non- 
Foisson  input  process  and  non-exponential  service  time;  a  system  that, 
to  date,  has  not  in  general  been  solved.  In  order  to  develop  a  model 
which  is  analytically  tractable  we  make  some  further  simplifying  assump¬ 
tions.  To  begin  with,  instead  of  a  network,  we  consider  a  path  of  a 
communication  network  (Fig.  3.6-a),  and  consider  the  traffic  between  the 
source  node  S  and  the  destination  node  D.  The  path  consists  of  n^  hops 
and  the  communication  links  between  adjacent  nodes  are  divided  into  N  ^ 
channels.  For  simplicity,  we  study  a  network  that  operates  under  the 
"forward  reservation  -  individual  signalling"  mode.  This  means,  a  sig¬ 
nal,  after  reserving  a  channel  at  a  node,  say  i,  uses  the  same  channel 
for  transmission  to  the  next  node  (i+1).  While  holding  the  channels 
reserved  between  the  source  node  S  and  node  i+  1,  the  signal  waits  at 
node  i +  1  for  a  channel  to  become  free;  at  this  time  the  signal  is  trans¬ 
mitted  to  node  i +  2  and  the  process  continues  similarly.  When  the  signal 
reaches  the  destination  node  D  a  complete  path  between  node  S  and  D  is 
set  up.  At  this  time  a  signal  (request-for-transmission)  is  sent  back 
to  node  S  through  the  reserved  channels  (hence  no  queueing  is  involved)*. 
Upon  reception  of  the  "request-for-transmission"  signal,  the  message  is 
transmitted  and  after  completion  of  the  transmission,  the  reserved  chan¬ 
nels  are  released  immediately.  The  fact  that  channels  are  reserved  from 
left  to  right  (i.e.,  there  is  an  ordering  in  the  reservation  process) 

★ 

The  RFT  signal  was  not  used  in  the  model  of  the  previous  section. 
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prevents  the  occurrence  of  deadlock  [COI'F  71 J.  Fig.  3.6-b  shows  the 
tandein  queue  model  of  the  communication  path  (Fig.  3.6-a).  In  this 


figure  each  trunk  is  abstracted  as  an  server,  FIFO  queueing  system. 
Because  of  the  reservation  process,  the  service  time  of  each  queue  is 
affected  by  waiting  times  in  the  succeeding  queues.  Let 

jL  x^  be  the  service  time  at  the  ith  queue  lj^i^n^ 

and 

s.  4— *■  ¥.  be  the  total  delay  at  the  i^  node  1  <  i  <  n, 
ri  —  —  h 

As  before,  tQ  and  t^  are  the  transmission  times  of  a  message  and  a  sig¬ 
nal,  respectively.  The  service  time  in  the  right -most  queue  (n^*'  queue) 
is 


=  t. 


*  *h  *  *0 


=  2t.  ♦  t. 


The  first  t^  corresponds  to  the  "request-for-connection"  signal  from 
node  to  node  n^+  1,  the  destination  node.  After  this  a  "request-for- 
transmission"  signal  originates  from  node  n^ +  1  and  is  sent  to  node  1. 
Because  a  complete  path  is  already  set  up,  this  transmission  takes  only 
t^  seconds.  After  reception  of  this  signal,  the  message  is  transmitted, 
which  takes  t^  seconds.  The  service  time  at  queue  n^  -  1  is  affected  by 
the  system  time  (waiting  plus  service  time)  of  the  n5^  queue,  so  we  have 


In  general,  we  have  the  following  expression  for  the  service  time  at  the 
i**1  queue: 


(3.8-a) 


+  Si*l 


♦  t. 


and  for  the  average  values  we  have 


1  <  i  <  n. 
—  n 


(3.8-b) 


x.  = 

i 


t,  +  s 
h  i+l 


1  <  i  <  n. 
—  n 


(3.9-a) 


2th  ♦  *0 


(3.9-b) 


In  order  to  simplify  the  analysis  we  accept  a  generalized  version  of  the 
independence  assumption  [KLEI  64]. 


Assumption  3.2 

The  distribution  of  service  time  at  each  of  the  queueing 
systems  shown  in  Fig.  3.6-b  is  negative  exponential  with  the 
average  values  determined  by  Eqs.  (3.9),  and  is  stochastically 
independent  of  the  service  time  of  its  succeeding  nodes. 

Assuming  that  the  input  process  of  messages  to  the  first  node 
is  Poisson,  by  virtue  of  Assumption  3.2,  the  input  process  to  all  of  the 
queues  on  the  path  will  be  Poisson  and  the  queueing  system  at  each  node 
can  be  treated  as  an  M/M/N^  queue,  being  the  number  of  servers 
(Appendix  B.l). 

To  find  the  path  delay  (the  time  between  the  arrival  of  a  mes¬ 
sage  at  the  first  node  until  the  completion  of  its  transmission),  one 
must  start  from  the  last  queue  (n^1)  and  iteratively  proceed  to  the 
first  queue;  the  system  time  of  queue  1,  5^  ,  is  the  total  path  delay. 
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Remarks 


(1)  It  has  been  shown  [CLOS  72A],  [KUMM  76A,  B]  that  in  the 
"common  channel"  signalling  method,  a  correct  signalling  channel  capac¬ 
ity  should  be  dynamically  adjusted  to  the  traffic  in  the  network. 

Because  the  emphasis  of  the  present  work  is  to  study  the  effect  of 
channel  holding  time  on  the  overall  performance,  we  based  our  study  on 
an  "individual  signalling  channel"  model,  hence  we  are  not  faced  with 
this  optimization  problem. 

(2)  Perhaps  it  would  be  more  accurate  to  make  some  simplifying 
assumptions  and  use  the  network  model  of  Section  3.3;  however,  as  we  are 
doing  a  comparison  study,  by  using  the  same  path  model  for  the  other 
switching  systems  (CTS  and  MS) ,  any  inaccuracy  which  is  the  result  of 
the  topology  will  be  common  to  all  three  systems. 

(3)  We  should  point  out  that  the  way  we  have  treated  the  hold¬ 
ing  time  distribution  is  not  accurate  in  a  narrow  sense.  Perhaps  some 
other  distributions  (e.g.,  Erlangian)  for  the  service  time  would  be  a 
better  choice;  however,  any  distribution  except  negative  exponential 
would  complicate  the  solution.  As  we  shall  see  shortly,  even  this  simple 
model  clearly  shows  the  effect  of  channel  reservation  and  holding  time. 

In  all  of  the  previous  reports,  except  in  [PORT  71],  the  queueing  system 
at  each  node  is  considered  as  an  M/M/m  system  in  which  the  average  ser¬ 
vice  time  is  the  same  as  the  average  transmission  time  of  a  message,  an 
assumption  which  is  far  from  realistic  (unless  the  model  is  for  a  fully 
connected  network);  our  model  corrects  that  defect. 
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3.5  Discussion  of  Results 

Based  on  the  model  developed  in  the  previous  section  we  present 
a  numerical  comparison  of  the  network  delay  for  the  three  switching 
systems:  CTS,  MS  and  CS.  The  network  model  is  the  one  shown  in  Fig. 
3.6-a  and  our  comparison  is  based  on  network  delay  only,  i.e.,  we  will 
disregard  the  access  delay  to  the  network,  as  this  component  of  delay 
is  common  to  all  three  systems.  A  total  trunk  capacity  of  C =  50  kbps 
is  used  and  for  CS  the  trunk  is  subdivided  into  N  ^  channels  (hence  for 
CTS  and  MS  we  consider  single  channel  links  only).  The  average  header 
(or  signal)  length  is  assumed  to  be  100  bits  and  the  average  message 
length  (1/p)  will  be  specified  for  each  case.  Regarding  the  distribution 
of  the  random  variables  involved  and  the  input  process,  we  accept  the 
assumptions  of  Section  2.1  and  also  assumption  3.2.  Lastly,  we  also' 
assume  that  the  channels  are  noiseless.  The  useful  utilization,  which 
for  simplicity  will  also  be  referred  to  as  utilization,  denoted  by 
p  =  X/pC  (X  being  the  traffic  rate  on  a  trunk) ,  is  the  fraction  of  the 
total  trunk  capacity  used  by  the  useful  information  (i.e.,  the  message). 
This  is  usually  different  from  (and  less  than)  the  effective  utilization 
pfi,  which  is  the  utilization  caused  by  the  useful  information,  the  header 
(or  signal)  and  the  holding  time  of  the  reserved  channel.  In  our  com¬ 
parison  study,  we  study  the  effect  of  four  parameters:  input  rate;  ' 
average  message  length,  1/p;  path  length,  n^;  and  the  number  of  channels 
per  line,  (this  last  parameter  is  only  important  for  CS). 

We  first  consider  the  network  delay  for  a  path  of  average 
length,  n^  -  4,  Figure  3.7-a.  The  average  message  length  (useful 
information)  is  1500  bits.  This  figure  shows  the  normalized  path 
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Figure  3.7-«  Companion  of  Switching  Syttaim. 


delay  ( =  delay/tg  =  delay  uC)  for  CTS,  MS  and  CS  when  the  trunk  is 
split  into  1,  2  and  4  channels  (for  CTS  and  MS,  is  always  1). 

Some  observations  can  be  made  on  these  curves.  As  we  already 
know  from  Chapter  2,  the  CTS  delay  is  always  smaller  than  the  MS  delay. 
For  the  particular  parameters  of  this  figure,  the  delay  of  CTS  is  always 
less  than  the  CS  delay;  however  this  is  not  always  the  case.  As  we  will 
see  in  the  following  figures,  in  some  situations  network  delay  for  CS 
becomes  less  than  the  CTS  delay. 

Comparing  CS  with  MS,  we  see  that  for  a  small  number  of  chan¬ 
nels,  N^,  at  low  input  rates  the  CS  delay  is  less  than  the  MS  delay. 
This  is  because  at  low  input  rates  the  waiting  time  to  reserve  a  channel 
is  small,  therefore,  path  set-up  time  is  small.  Once  a  path  is  set  up, 
the  message  is  transmitted  without  incurring  further  delay  in  the  inter¬ 
mediate  nodes.  On  the  other  hand,  in  MS  a  message  has  to  be  assembled 
at  all  of  the  intermediate  nodes,  a  process  that  causes  excessive  delay. 
When  the  number  of  channels  is  large  (e.g.,  Nph=4),  the  CS  channel 
capacity  is  small  and  even  though  the  path  set  up  is  negligible,  trans¬ 
mission  time  of  a  message  on  a  (small  capacity)  channel  is  excessive  and 
the  total  path  delay  becomes  larger  than  MS.  As  the  input  traffic 
increases,  Fig.  3.7-a  shows  that  CS,  with  a  small  number  of  channels  per 
link,  saturates  very  fast  (see  the  delay  curve  for  =  1)  and  the  delay 
curve  climbs  up  rapidly.  This  rapid  saturation  is  the  result  of  exces¬ 
sive  waiting  time  to  reserve  a  channel  and  as  we  already  know,  this 
delay  is  reflected  in  channel  holding  time.  By  further  increasing  the 
number  of  channels  per  link,  because  the  waiting  time  is  reduced,  the 
network  does  not  saturate  so  fast  (see  the  CS  delay  curve  for  N  ^  =  4) , 
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even  though  the  transmission  time  increases.  Comparing  this  curve  with 
the  delay  curve  of  MS  we  observe  an  interesting  behavior.  For  very 
small  input  rates,  the  MS  delay  is  less  than  the  CS  delay;  however,  as 
the  input  rate  increases,  there  is  a  cross-over  between  the  two  delay 
curves,  and  for  a  short  interval  the  CS  delay  becomes  less  than  the  MS 
delay.  For  a  further  increase  of  input  rate,  there  is  a  sharp  increase 
in  the  CS  delay  and  MS  again  becomes  better  than  CS  (from  the  delay 
point  of  view) .  The  cross-over  occurs  because  for  a  certain  range  of 
input  rates  the  combined  waiting  time  and  reassembly  delay  at  the  inter¬ 
mediate  nodes  for  MS  becomes  more  than  the  path  set-up  delay  and  message 
transmission  time  for  CS.  We  should  notice  that  the  interval  at  which 
CS  becomes  better  than  MS  depends  very  much  on  the  parameters  of  the 
system.  For  example,  for  a  small  average  message  length  (1/p  =  1000 
bits;  Fig.  3.7-b),  there  is  no  cross-over  point  for  the  MS  and  CS  delay 
curves  (for  Nc^=4).  (In  Fig.  5.7-b  CTOss-over  occurs  at  larger  values 
of  N^.)  For  comparison,  we  have  also  presented  delay  curves  for  the 
average  message  length  of  2000  bits  in  Fig.  3.7-c  and  a  longer  path 
length,  n^  =  8,  in  Fig.  3.8.  Comparison  of  Fig.  3.7-a  with  Fig.  3.8 
shows  that  for  long  path  lengths,  CS  is  more  advantageous  than  MS  in  a 
larger  region.  This  is  because  in  MS  it  is  necessary  to  assemble  the 
message  in  all  of  the  intermediate  nodes  and  so  the  network  delay  becomes 
large,  whereas  in  CS,  once  a  path  is  set  up,  there  is  no  further  delay 
due  to  intermediate  nodes. 

Before  going  further,  we  find  it  useful  to  point  out  an  unusual 
behavior  of  CS.  Consider  two  identical  queueing  systems  that  differ  only 
in  the  number  of  servers;  the  total  service  capacity  of  both  systems  are 
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Figure  3.8  Comparison  of  Switching  Systems. 
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the  same.  The  waiting  time  (time  from  the  arrival  of  a  request  until 
the  request  starts  service)  in  the  system  with  a  larger  number  of 
servers  is  always  smaller  than  the  other  one;  however,  the  total  delay 
(time  between  the  instant  of  arrival  until  the  instant  of  departure  from 
the  system)  for  the  system  with  the  smaller  number  of  servers  is  always 
less  than  the  other  [KLEI  74A] .  The  delay  curves  for  CS  do  not  show  this 
behavior.  In  Fig.  3.7-a,  as  the  number  of  channels  per  link  is  increased 
(the  total  capacity  is,  however,  kept  constant),  for  small  input  rates, 
the  system  delay  for  a  smaller  number  of  channels  is  less  than  the  delay 
for  a  larger  number  of  channels;  however,  as  the  input  rate  is  increased, 
the  delay  in  the  system  with  the  smaller  number  of  channels  becomes 
unbounded,  whereas  the  systems  with  the  larger  number  of  channels  stay 
unsaturated:  This  behavior  is  the  result  of  the  peculiar  interaction  of 
demands  for  service  from  the  system. 

Figs.  3.9-a  and  3.9-b  show  the  effect  of  message  length  on  net¬ 
work  delay.  In  these  figures  the  useful  utilization  is  kept  constant 
(p  =  0.3),  hence  the  input  rate  varies  as  the  message  length  changes 
(the  header,  or  signal,  length  is  kept  constant:  100  bits).  For  too 
small  message  lengths  (1/p  <  100  bits),  the  network  delay  for  all  three 
switching  systems  is  unbounded,  a  consequence  of  the  high  input  rate 
(to  keep  the  useful  utilization  constant).  As  the  message  length 
increases,  the  network  delay  first  decreases  and  then  grows  again.  As 
before,  we  observe  that  the  CTS  delay  is  always  less  than  the  MS  delay. 
For  CS,  after  a  sharp  decrease  (which  indicates  a  rapid  recovery  from 
saturation)  the  delay  escalates;  however  the  rate  of  increase  for  the 
delay  in  CS  is  less  than  the  rate  for  the  other  switching  systems  and 
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Figure  3.9-a  Comparison  of  Switching  System*. 


10.0  r 


AVERAGE 


eventually  the  CS  delay  becomes  less  than  the  MS  and  the  CTS  delays. 

This  phenomenon  can  be  seen  better  in  Fig.  3.9-b  where  the  path  length 
is  long.  This  figure  also  shows  that  at  longer  path  lengths  CS  becomes 
more  advantageous  in  reducing  the  network  delay.  Comparison  of  the  delay 
curves  for  CS  indicates  that  the  saturation  point  for  a  larger  number  of 
channels  per  link  is  higher  (i.e.,  the  network  can  accept  larger  input 
rates).  This  fact  is  shown  even  more  clearly  in  Fig.  3.10  where  we  have 
plotted  the  saturation  traffic  of  CS  for  different  numbers  of  channels 
per  link  as  a  function  of  the  average  message  length. 

In  Figures  3.11  to  3.13  the  delay  of  MS  is  compared  with  the 
CS  delay  at  different  utilization--message  length  combinations.  We 
consider  three  path  lengths:  short,  n^=2;  medium,  n^=4;  and  long 
path  length,  n^ =  8  (Figs.  3.11,  3.12  and  3.13,  respectively).  For  each 
path  length  we  consider  6  different  numbers  of  channels,  =  1,  2,  4, 

6,  8  and  10  (Figs.  3.11-a  to  3.11-f,  etc.).  Note  that  MS  always  uses 
the  full  capacity  of  the  trunk  and  there  is  no  splitting  involved  in  this 
case.  Again,  the  header  length  is  kept  at  100  bits  and  the  total  trunk 
capacity  is  50  kbps  in  all  cases  we  study.  In  these  figures  the  follow¬ 
ing  notation  has  been  used: 

*  :  MS  delay  <  CS  delay 

space  (blank)  :  MS  delay  >  CS  delay 

•  :  MS  delay  <  °»;  CS  delay  unbounded 

X  :  both  the  MS  and  CS  delay  unbounded 

For  a  fixed  path  length  the  area  in  which  CS  is  operational 
(the  network  is  not  saturated)  expands  with  an  increase  in  the  number  of 
channels  per  link.  This  area  is  denoted  by  or  "space"  (blank),  (the 
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Figure  3.10  Saturation  Point  of  Circuit  Switching. 
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region  designated  by  or  "X"  is  where  the  CS  delay  is  unbounded). 

For  a  given  number  of  channels,  as  the  path  length  increases,  the  area 
where  CS  is  operational  shrinks.  This  effect  can  be  seen  by  comparing, 
for  example.  Figs.  3.11-f,  3.12-f  and  3.13-f  with  each  other.  For 
short  path  lengths  a  large  number  of  channels  is  not  advantageous  to 
CS  (Fig.  3.11),  although  a  small  number  of  channels  shrinks  the  oper¬ 
ational  region  of  CS. 

To  complete  our  study,  we  have  done  the  same  type  of  comparison 
between  CTS  and  CS  in  Fig.  3.14  for  a  long  path  length,  n^=8.  As  we 
pointed  out  earlier,  CS  manifests  its  advantage  when  the  path  length  is 
long;  for  shorter  path  lengths  (n^<8),  CS  is  hardly  better  than  CTS. 
Figure  3.14  is  very  similar  to  its  counterpart  figure.  Fig.  3.13;  how¬ 
ever  the  area  more  favorable  to  CS  has  shrunk.  This  is  due  to  the  fact 

that  the  CTS  delay  is  always  less  than  the  MS  delay. 

Table  3.1  shows  the  results  of  our  comparison  study.  In  this 
table  we  have  chosen  two  intervals  for  the  message  length  (500  <_  1/p 
<_  2000  and  2000  <  1/p)  and  three  intervals  (small,  medium  and  large)  for 
the  other  three  parameters  (p,  and  n^) .  In  each  box  the  entry  in 
the  upper  left  corner  is  the  result  of  the  comparison  between  CS  and  MS 
and  the  one  in  the  lower  right  corner  relates  to  CS  and  CTS.  If  in  an 

interval  the  MS  delay  is  less  than  the  CS  delay,  then  the  CTS  delay  is 

also  less  than  the  CS  delay  and  we  have  indicated  only  "MS"  for  such  an 
interval.  In  some  cases  we  have  indicated  two  switching  systems  (e.g., 
CS  and  MS) .  This  indicates  that  the  selected  range  is  too  large  and  a 
finer  interval  should  be  specified  to  determine  a  firm  advantage  between 
the  two  systems. 
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Summarizing  our  observations,  the  following  remarks  can  be 

made: 

(1)  The  reservation  operation  in  CS  causes  a  substantial 

decrease  in  the  network  capacity.  For  this  reason  CS  is  not  a  good 

choice  when  the  traffic  is  high  (Table  3.1). 

(2)  For  large  messages  and  large  path  lengths,  if  the  utili¬ 
zation  is  not  too  high,  CS  usually  outperforms  the  other  two  switching 

systems  (Table  3.1). 

(3)  When  the  capacity  of  the  trunk  is  kept  fixed,  proper 
selection  of  the  number  of  channels  per  trunk  is  very  critical  for  the 
proper  operation  of  the  CS  system.  In  fact,  this  number  has  an  opposing 
effect  on  the  performance  of  the  network.  When  the  number  of  channels 
is  small,  then  the  network  saturates  quickly;  on  the  other  hand,  a  large 
number  of  channels  results  in  a  large  delay.  We  deal  with  the  problem 
of  optimal  number  of  channels  in  the  next  section. 

(4)  Based  on  our  study,  the  decision  as  to  which  form  of 
switching  to  use  should  be  based  on  a  careful  study  of  the  parameters 
of  the  network  and  no  general  quantitative  statement  in  this  regard  can 
be  made. 


If  the  data  stream  is  intermittent,  i.e.,  if  a  communication 
session  consists  of  an  alternating  sequence  of  message  blocks  and  idle 
periods,  depending  on  the  duration  of  the  idle  periods  and  the  message 
lengths,  holding  the  connection  during  the  idle  periods  may  be  to  the 
advantage  of  CS.  We  have  not  studied  such  cases:  however,  the  analytic 
model  developed  in  the  section  can  easily  be  modified  to  encompass 


this  situation. 

3.6  Optimal  Number  of  Channels  in  Circuit  Switching 

With  the  growth  of  telecommunication  networks  and  remote  data 
processing  facilities  there  has  been  an  increased  diversity  in  the  char¬ 
acteristics  of  the  traffic  requirement  for  these  communication  systems. 
Our  study  in  the  previous  section  indicates  that  no  single  switching 
system  can  provide  the  best  performance  for  all  types  of  traffic  require¬ 
ments  and,  in  fact,  the  best  choice  for  the  switching  technique  is  an 
integrated  system  of  different  techniques.  Integration  of  switching 
systems  has  recently  become  a  subject  of  interest  ([FISC  76],  [GERL  78], 
[PORT  76]  and  the  references  therein).  In  [PORT  76J,  the  feasibility 
of  such  an  integration  was  studied;  based  on  that  study  a  network  archi¬ 
tecture  was  proposed.  (In  fact,  the  virtual  cut-through  switching  that 
we  proposed  in  Chapter  2  was  motivated  by  the  idea  of  integrating  circuit 
switching  and  packet  switching.) 

In  this  section  we  assume  that  such  an  integration  is  feasible 
and  that  we  have  the  hardware  facility  to  integrate  both  MS  and  CS  in  a 
single  network  such  that,  depending  on  the  users'  demands  and  network 
load,  a  proper  switching  mode  can  be  selected  to  provide  the  best  grade 
of  service  (we  consider  only  MS  and  CS) .  We  address  the  following  ques¬ 
tion:  given  the  trunk  capacity  and  the  path  length,  which  switching 
technique  (MS  or  CS)  should  be  used  to  minimize  the  delay,  and  if  the  CS 
system  is  to  be  used,  what  is  the  best  choice  for  N  ^  (number  of  channels 
per  link)? 

For  the  solution  we  consider  two  cases:  dynamic  and  static 
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trunk  splitting.  In  the  first  case,  the  number  of  channels  per  link  can 

be  dynamically  adjusted  according  to  the  load  pattern  in  the  network  in 

order  to  minimize  the  network  delay.  In  the  static  case,  however,  once 

the  value  of  N  ,  is  chosen,  it  cannot  be  changed.  ' 

ch 

Case  I:  Dynamic  Trunk  Splitting 

Dynamic  trunk  splitting  can  be  done  by  synchronous  time  division 
multiplexing  techniques  (STDM);  we  do  not  address  the  implementation 
issues.  We  assume  that  the  number  of  channels  per  link  can  be  adjusted 
in  order  to  minimize  the  network  delay.  If  it  turns  out  that  MS  results 
in  a  smaller  network  delay,  then  the  total  trunk  capacity  is  dedicated 
to  message  switching.  The  result  of  such  an  optimal  channel  splitting 
and  switching  system  selection  is  shown  in  Figs.  3.15-a  through  3.15-c 
where  we  show  results  for  three  different  path  lengths.  For  an  "average 
message  length  -  useful  utilization"  combination,  if  CS  results  in  a 
smaller  delay,  the  optimum  number  of  channels  (modulo  10)  is  printed. 

If  CS  does  not  give  a  smaller  delay  than  MS,  then  that  point  is  left 
blank.  The  areas  designated  by  "X",  as  in  the  previous  figures,  are 
where  both  MS  and  CS  delays  are  unbounded.  Fig.  3.15  shows,  for  a  given 
useful  utilization,  that  CS  is  preferable  for  larger  message  lengths  and 
MS  is  advantageous  for  shorter  message  lengths.  It  should  be  noted,  for 
some  values  of  the  utilization  (for  example  p>0.6)  that  MS  is  the  only 
choice,  as  the  CS  delay  is  unbounded  for  these  values  of  trunk  utiliza¬ 
tion.  On  the  other  hand,  for  a  given  average  message  length,  smaller 
utilizations  favor  CS;  for  large  utilizations  MS  is  a  better  choice  for 
switching. 
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Figure  3.16-c  Dynamic  Channel  Splitting  -  Optimal  Number  of  Channel*  for  CS. 


In  static  trunk  splitting,  the  number  of  channels  per  link 
is  chosen  for  all  possible  values  of  average  message  length  and  useful 
trunk  utilization.  Because  our  selection  is  based  upon  the  delay  com¬ 
parison  with  MS,  we  choose  N  ^  such  that,  for  a  given  path  length,  in 
the  p,  1/u  plane  (1/p  is  the  average  message  length),  the  region  where 
CS  is  superior  to  MS  becomes  maximal.  For  example,  for  n^=4.  Fig.  3.12 
shows  that  =  4  is  a  better  choice  than  =  2  or  1  (actually  the  best 
choice  in  this  case  is  Nc^=3,  see  Fig.  3.16).  Fig.  3.16  shows  the 
values  of  N  ^  chosen  according  to  this  criterion,  where  we  have  assumed 
the  following  ranges  for  the  utilization  and  for  the  average  message 
length:  0  £  p  <_  1  and  500  <  1/p  <_  20,000.  This  figure  shows,  for  static 
channel  splitting,  that  loneer  oath  lengths  require  a  larger  number  of 
channels  per  link.  Once  a  value  for  N  ^  is  chosen,  the  operating  rule 
regarding  where  to  change  the  switching  mode  is  determined  by  using  a 
graph  such  as  Fig.  3.12. 

We  point  out  that  our  treatment  of  the  problem  is  not  rigorous 
since  we  have  implicitly  assumed  that  all  values  of  the  variables  occur 
with  the  same  likelihood  (consider  for  example  a  system  for  which  most 
messages  are  of  short  length).  For  a  more  exact  solution,  one  should 
impose  proper  weight  factors  on  different  intervals  of  the  range  of  the 
variables . 

3.7  Conclusion 

In  this  chapter  we  first  presented  a  mathematical  model  for  the 
delay  performance  of  circuit  switching  which  incorporated  the  effect  of 
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Figure  3.16  Static  Channel  Splitting  -  Optimal  Number 
of  Channels  for  CS. 


reservations.  By  using  this  model  we  carried  out  a  comparison  study 
between  the  delay  performance  of  three  switching  systems:  circuit 
switching,  message  switching  and  cut-through  switching.  Our  study  showed 
that  the  boundary  between  the  areas  of  relative  effectiveness  of  these 
switching  techniques  depends  greatly  on  the  network  topology  (which  was 
reflected  in  the  path  length),  the  message  length  and  the  useful  utili¬ 
zation.  For  the  circuit-switched  system,  the  number  of  channels  per 
trunk  is  also  an  important  parameter.  While  this  study  confirmed  the 
previous  understandings,  it  also  quantitatively  showed  the  effect  of  some 
parameters  (e.g.,  the  path  length)  which  have  so  far  been  disregarded. 
Based  on  our  studies,  when  the  number  of  channels  per  trunk  in  CS  is 
properly  selected,  CS  is  a  better  choice  of  switching  system  than  MS 
and/or  CTS  at  low  utilizations  or  large  message  lengths.  When  the  com¬ 
munication  path  is  long,  again,  CS  outperforms  the  other  two  switching 
systems.  In  general,  the  decision  on  the  choice  of  switching  method 
depends  on  the  parameters  of  the  system,  and  no  definite  quantitative 
statement  in  this  regard  can  be  made.  Our  study  in  Section  3.6  showed 
that  the  best  choice  is  to  integrate  different  switching  techniques  in 
a  single  network  load  so  that  according  to  the  user  demands  and  the  net¬ 
work  load,  a  proper  switching  mode  can  be  selected  to  provide  the  best 
grade  of  service. 
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CHAPTER  4 


STATIC  PLOW  CONTROL  IN 
STORE- AND- FORWARD  COMPUTER  NETWORKS 


The  last  two  chapters  were  concerned  with  the  switching  problems 
in  computer-based  communication  networks.  In  this  chapter  and  the  next 
we  focus  on  the  analysis  of  flow  control  techniques  in  such  networks. 

A  computer  network  is  a  collection  of  resources  to  be  used  by 
competing  users.  Resources  of  such  networks  can  be  categorized  in  two 
classes  [POUZ  76B] : 

•  Basic:  buffers,  transmission  bandwidth,  processor  time 
•Incidental:  name  space,  table  entries,  logical  channels,  etc. 
"Users"  are  the  messages  to  be  transmitted.  The  collection  of  resources 
has  a  limited  capacity  which  causes  conflicts  to  occur  among  the  users 
of  the  system.  These  conflicts  may  result  in  a  degradation  of  perfor¬ 
mance  of  the  system  to  the  point  that  the  system  becomes  clogged  and  the 
throughput  goes  to  zero.  This  behavior  is  typical  of  "contention"  sys¬ 
tems  which  are  described  by  Agnew  [AGNE  74]  as  follows: 

"Congestion-prone  systems  include  queues,  where  long  waiting 
times  accompany  nearly  maximal  throughput,  and  'contention'  systems 
where  the  throughput  goes  to  zero  and  the  waiting  time  to  infinity 
for  large  enough  loads.  In  a  queue-like  system  congestion  is  typi¬ 
cally  the  result  of  uneven  or  uncertain  behavior  by  users  and  ser¬ 
vers.  While  this  behavior  is  also  present  in  a  contention  system, 
additional  user-user  and  user-server  interaction  help  to  cause  the 
declining  throughput..." 

Networks  cannot  afford  to  accept  all  the  traffic  that  is  offered 
without  control;  there  must  be  rules  which  govern  the  acceptance  of 
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traffic  from  outside  and  coordinate  the  flow  inside  the  network.  These 
rules  arc  commonly  known  as  flow  control  procedures.  A  more  precise 

definition  of  flow  control  is  given  by  Pouzin  [POUZ  76B] : 

Definition  4.1 

Flow  control  is  the  set  of  mechanisms  whereby  a  flow  of 
data  can  be  maintained  within  limits  compatible  with  the  amount 
of  available  resources. 

In  order  to  keep  the  network  traffic  within  desirable  limits, 
flow  control,  among  other  things,  should  be  equipped  with  mechanisms  to 
throttle  some  resources.  The  main  throttling  tools  are  [POUZ  76B] : 
stop-and-go,  where  a  sender  continues  transmission  until  it  is  signalled 
not  to;  credit,  where  the  sender  receives  an  indication  giving  the  amount 
of  data  it  can  transmit;  rate,  where  the  sender  transmits  traffic  at  a 
predetermined  rate;  delay,  where  the  sender’s  transmissions  are  slowed 
down  by  excessive  network  delay;  and  class,  where  the  messages  carry 
with  them  a  class  indicator  and  in  case  of  congestion  certain  classes 
are  denied  entry.  These  tools  have  been  identified  as  the  "basic  throt¬ 
tling  tools"  as  opposed  to  the  "incidental  throttling  tools"  which  are 
imposed  on  incidental  resources  [POUZ  76B] . 

Existing  control  methods  in  store- and- forward  communication 
nets  can  be  classified  as  local  control  and  global  control.  Local  con¬ 
trol  is  applied  by  a  communication  processor  within  the  subnet  on  the 
basis  of  its  own  as  well  as  its  immediate  neighbors'  traffic  data  and 
resource  utilization.  This  type  of  control  is  a  direct  consequence  of 
limited  buffer  space  in  each  node  and  is  basically  useful  for  prevention 
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of  local  congestion.  If  the  users  who  chiefly  contribute  to  an  overload 
are  distant  from  the  point  of  congestion,  local  control  methods  require 
that  congestion  measures  spread  over  long  distances  before  effective 
measures  are  taken  [DAVI  71].  Thus  local  control  is  not  by  itself  suf¬ 
ficient  to  prevent  congestion,  and  global  control  is  necessary  in  order 
to  further  stop  the  input  to  the  network  well  before  the  network  is 
loaded  to  saturation.  This  control  can  be  accomplished  by  limiting  the 
number  of  packets  which  are  simultaneously  in  the  network.  Existing 
methods  of  global  flow  control  are:  isarithmic  flow  control  [DAVI  71] 
used  in  the  NPL  network,  in  which  the  total  number  of  credits  in  a  net¬ 
work  are  controlled  and  kept  constant,  and  end-to-end  flow  control 
[MCQU  72]  used  in  the  ARPANET,  where,  basically,  the  total  number  of 
credits  between  two  users  are  limited. 

Most  end-to-end  flow  control  mechanisms  use  a  variant  of  the 
credit  throttling  technique  and  are  usually  described  in  terms  of  window 
size  [CERF  74],  [BELS  75],  where  the  number  of  unacknowledged  messages 
(or  packets)  are  limited  to  the  "window  size."  A  variant  of  this  mech¬ 
anism  was  originally  used  in  the  French  Cyclades  network  [ZIMM  75]  and 
the  ARPANET  VDH  (Very  Distant  Host)  connections  [BBN  73]. 

End-to-end  flow  control  is  accomplished  through  inter-process 
communication  protocols  and  any  attempt  to  quantitatively  study  the 
former  should  start  with  the  development  of  an  analytic  model  for  the 
latter.  Several  authors  have  discussed  techniques  for  flow  control  at 
different  levels  [CARR  70],  [DAVI  71],  [KAHN  71],  [WALD  72],  [OPDE  74], 
[BELS  75],  [CROW  75],  [DANT  75A,B],  [ZIMM  75],  [KLEI  76B]  and  [POUZ  76B] 
However,  due  to  the  complex  multi-variate  environment  of  flow  control. 


125 


r  1  ■■  ■  .  . . . — - 

quantitative  and  analytic  results  have  been  lacking.  Except  for  simu¬ 
lation  studies  of  flow  control  [PRIC  73],  [GIES  76]  and  [LELA  76],  in 
most  analytic  works  the  authors  have  focused  on  the  influence  of  the 
individual  tools  of  flow  control  on  the  overall  performance  [PENN  75], 
[SUNS  75],  [CHAT  76],  [CHOU  76],  [l’AYO  77]  but  in  general,  due  to  the 
complexity  of  the  problem,  the  interaction  of  these  tools  has  not  been 
analytically  studied. 

The  purpose  of  this  chapter  is  to  develop  analytic  models  for 
end-to-end  communication  protocols  and  to  study  the  window  mechanism 
for  flow  control  in  store-and- forward  computer  based  communication  net¬ 
works.  We  start  with  a  very  simple,  deterministic  model  in  which  there 
are  no  stochastic  fluctuations  in  the  load  on  the  system.  Then,  step- 
by-step,  we  improve  this  basic  model  and  develop  a  static  flow  control 
model  in  which  the  parameters  of  the  system  are  not  dynamically  adjusted 
to  the  stochastic  fluctuations  of  the  system  load.  We  will  use  the 
static  flow  control  results  in  the  next  chapter  to  develop  a  dynamic 
flow  control  system. 

4.1  The  Model 

As  in  the  previous  chapters,  we  continue  to  assume  that  messages 
consist  of  single  packets;  hence  we  do  not  differentiate  between  message 
switching  and  packet  switching.  Because  the  present  study  is  concerned 
with  the  analysis  of  end-to-end  protocols,  we  consider  a  communication 
session  between  a  source  and  destination  (say  nodes  A  and  BJ  in  a  com¬ 
munication  network.  The  environment  under  consideration  is  shown  in 
Fig.  4.1.  In  this  figure  the  traffic  controller  (TC)  in  the  source  node 
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is  responsible  for  regulating  the  traffic  from  node  A  to  node  B.  It 
allows  a  maximum  of  w  (the  window  size)  unacknowledged  messages  from 
the  sender  to  be  present  in  the  network.  The  messages  are  sent  toward 
the  destination  node  and  a  copy  of  them  is  retained  in  the  buffer  space 
provided  by  the  TC.  When  the  TC  receives  an  acknowledgement  (ACK)  for 
a  message,  it  discards  the  copy  of  the  message  it  is  storing  and  accepts 
a  new  message  from  the  sender.  In  the  destination  node  messages  which 
are  received  and  can  be  accepted  are  transmitted  to  the  receiver  and 
ACKs  for  them  are  sent  back  to  the  TC  (later  we  will  explain  why  a  mes¬ 
sage  may  not  be  accepted).  We  also  assume  that  the  traffic  on  the  path 
is  symmetric  (i.e.,  there  is  the  same  flow  of  data  from  the  receiver  to 
the  sender)  and  that  the  ACKs  are  piggybacked  on  the  messages,  hence 
there  is  no  extra  traffic  due  to  ACKs.  We  also  assume  that  the  sender 
is  fast  compared  to  the  network  so  that  whenever  the  TC  can  accept  a 
new  message,  the  sender  has  one  ready  for  transmission.  With  these 
assumptions,  the  throughput  is  mainly  determined  by  the  network  and 
throughout  our  study  we  will  be  concerned  with  the  maximum  throughput- 
delay  behavior  of  the  network  within  the  boundaries  shown  in  Fig.  4.1. 
Maximum  throughput  is  the  maximum  rate  at  which  the  network  can  accept 
new  messages  (hence  it  is  the  upper  bound  of  the  acceptable  input  rate 
to  the  network).  For  simplicity,  when  there  is  no  ambiguity,  we  use 
"throughput”  to  indicate  "maximum  throughput."  The  end-to-end  delay 
(which  we  will  also  refer  to  as  the  total  delay)  is  the  delay  a  mes¬ 
sage  experiences  from  the  time  it  is  submitted  to  the  TC  until  it  is 
delivered  to  the  receiver;  it  consists  of  the  network  delay,  the  delay 
incurred  in  the  TC  and  the  (sub) network,  plus  the  destination  node  delay 
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(note  that  the  end-to-end  delay  does  not  include  the  admission  delay  to 


the  system).  Another  delay  measure  which  is  important  in  our  study  is 
the  acknowledgement  delay  (T  )  which  is  the  period  from  the  time  a  mes- 
sage  is  delivered  to  the  (sub)network  until  its  ACK  is  received  by  the 
TC.  We  point  out  that  the  ACK  delay  is  equal  to  the  round-trip  delay 
only  if  the  message  transmission  is  successful.  If  a  message  is  lost 
in  the  network  (due  to  a  transmission  error  and/or  rejection  at  the  des¬ 
tination  node;  see  Section  4.4.1)  no  ACK  for  the  message  is  received 
(hence  T  ,  the  acknowledgement  delay,  is  infinite). 

We  have  not  specified  the  timeout  and  certain  other  mechanisms 
of  the  system.  We  elaborate  on  these  operations  as  the  need  arises  in 
later  sections.  In  a  store-and-forward  computer  network,  since  packet 
sequences  may  be  received  out  of  order,  the  destination  node  must  be 
equippe^j  with  a  mechanism  to  deliver  packets  or  messages  in  correct 
order.  This  mechanism  can  be  implemented  in  different  ways:  the  desti¬ 
nation  node  may  reject  out-of-order  packets  or  messages;  or  it  may  store 
the  out-of-order  packets  and  messages,  but  deliver  them  in  the  original 
order  to  the  receiver.  The  first  solution  results  in  a  waste  of  network 
channel  bandwidth  (because  of  retransmissions),  whereas  the  second  solu¬ 
tion  results  in  an  excessive  buffer  storage  requirement  at  the  destina¬ 
tion  node.  The  sequencing  problem  is  more  important  for  the  case  of 
multi-packet  messages.  Because  of  our  assumption  of  single  packet  mes¬ 
sages,  the  sequencing  problem  is  not  as  critical  in  our  study  and  we  will 
ignore  this  issue.  In  (SUNS  75]  the  effect  of  sequencing  on  the  network 
performance  has  been  partially  investigated.  Lastly,  due  to  the  single 
packet  message  assumption,  there  will  not  be  any  case  of  reassembly 
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deadlock  in  the  system  we  study  and,  in  rare  situations,  only  store- 
and-forward  deadlock  may  occur  [KAHN  71],  [OPDE  74];  therefore,  we  will 
ignore  the  deadlock  problem  as  well. 

4.2  A  Fluid  Approximation 

We  begin  our  study  with  the  analysis  of  a  purely  deterministic 
model.  We  assume  that  the  random  variables  involved  are  deterministic, 
hence  this  model  represents  a  fluid  approximation  of  the  original  system 
[NEWE  68],  [KLEI  76B] .  We  also  assume  that  the  network  is  completely 
reliable,  which  means  that  the  channels  are  noiseless  and  for  every  mes¬ 
sage  delivered  to  the  (sub)network  by  the  TC,  an  ACK  is  received  (with 
probability  one).  We  use  the  following  notation: 

Ta  ACK  delay  (sec) 

1/p  Message  length  (bits) 

Cj  Network  channel  capacity  (bps) 

Xj  Message  transmission  time  on  a  channel  (=  1/pC^)  (sec) 

Yj  Maximum  discharge  rate  of  messages  (  =  pC^)  (msg/sec) 

A*  Maximum  throughput  (or  acceptance  rate)  of  the  network  (msg/sec) 
X  Input  rate  of  messages  to  the  network  (msg/sec) 
w  Window  size  (msg) 

Consider  Fig.  4.2;  if  at  time  tQ  a  message  is  delivered  to  the 
network,  its  ACK  will  be  received  at  time  t.(  =  t_  +  T  ).  Because  every 

1  U  3 

T  seconds  one  message  is  accepted  by  the  network,  for  a  window  size 
equal  to  1  (w =  1),  the  maximum  throughput  becomes  1/T  and  in  general 
for  a  window  size  of  w  the  maximum  throughput  is  w/T  .  Fig.  4.3  shows 
a  sketch  of  the  maximum  throughput  as  a  function  of  the  window  size. 
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Because  the  throughput  cannot  exceed  Yj .  the  maximum  discharge  rate,  the 
curve  levels  off  when  the  window  reaches  a  critical  size  (w*) .  Summing 
up,  we  have  the  following  relationship  between  the  maximum  throughput 
and  the  window  size: 


!w/T  w  <  w* 

a 

Yj  w  >  w* 


(4.1) 


where  w*  =  T  y, •  Note  that  A*  is  the  maximum  throughput  that  the  net- 
work  can  carry;  the  input  rate  (A)  is  bounded  above  by  A*. 

In  order  to  obtain  a  quantitative  value  for  the  acknowledgement 
delay  we  make  some  assumptions  on  the  structure  of  the  network.  Fig. 

4.4  shows  a  communication  path  of  n^  hops  between  source  A  and  desti¬ 
nation  B.  Because  the  transmission  delay  on  each  channel  is  Xj,  the 
acknowledgement  delay  becomes 


T 

a 


2Vi 


2nh/y, 


and  Eq.  (4.1)  reduces  to 


wheie  w*  =  2n,  . 

h 


wY,/2n,  w  <  w* 

1  h  — 

Yj  w  >  w* 


(4.2) 


Although  this  model  is  very  crude,  it  still  reveals  some  inter¬ 
esting  characteristics  of  the  system.  In  particular  it  shows  that  the 
traffic  and  the  throughput  of  the  system  can  be  controlled  by  the  window 
mechanism.  Furthermore,  it  is  not  beneficial  to  increase  the  window 
size  beyond  a  certain  value.  Considering  the  fact  that  a  large  window 
size  means  a  large  number  of  buffers  in  the  traffic  controller  and  that 
the  buffer  storage  is  a  cost  item  in  the  overall  network  cost,  the 
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uselessness  of  a  large  window  size  becomes  clear.  Regarding  the  best 
window  size,  w*,  Fig.  4.3  shows  w  should  be  chosen  such  that  there  is 
one  message  on  each  channel  (or  node);  in  other  words,  one  should  try  to 
fill  the  pipe  of  channels  along  the  path. 

4.3  A  Simplified  Stochastic  Analysis 

We  improve  the  accuracy  of  the  model  developed  in  the  last  sec¬ 
tion  by  permitting  some  fluctuations  in  the  arrival  process  and  message 
lengths.  We  accept  the  following  assumptions: 

Assumption  4.1 

1.  External  Poisson  arrivals  to  the  traffic  controller 
(with  rate  X) 

2.  Exponentially  distributed  message  lengths  (with  mean  1/p) 

Although  external  traffic  is  accepted  to  the  network  by  the  traffic 
controller's  demand  and  the  data  stream  is  not  continuous,  we  assume 
that  the  overall  process  of  the  external  message  arrival  to  the  TC  is 
Poisson.  Section  2.6.1  contains  a  detailed  justification  of  the  second 
assumption. 

Except  for  the  above,  we  continue  to  accept  the  assumptions 
made  in  the  last  section.  In  particular  we  assume  that  the  network  is 
reliable,  that  the  channels  are  noiseless  and  that  the  destination  node 
accepts  all  of  the  received  messages;  hence  no  retransmission  is  neces¬ 
sary  (more  about  retransmission  in  Section  4.4). 

When  the  flow  in  the  network  is  stabilized,  the  input  rate  to 
all  of  the  nodes  (Fig.  4.4)  becomes  identical.  By  accepting  the  inde¬ 
pendence  assumption  of  Kleinrock  [KLEI  64],  we  are  in  a  position  to  find 
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the  acknowledgement  delay  (which  is  the  same  as  the  round-trip  delay) 
and  the  throughput.  By  virtue  of  the  independence  assumption  each  chan¬ 
nel  on  the  path  behaves  like  an  M/M/1  queueing  system.  There  are  2nj 
channels  on  a  round-trip,  therefore 

2nh 

Ta  '  —  <n  •  t4-3) 

where  1/ (Yj  -  *)  is  the  average  system  time  of  an  M/M/1  system  with 
service  rate  y^  and  input  rate  A.  On  the  average  each  Ty  seconds  a 
maximum  of  w  messages  can  be  accepted  to  the  system;  therefore,  the  max¬ 
imum  input  rate  becomes 


(4.4) 


For  an  input  rate  A  =  A*,  we  can  solve  Eqs.  (4.3)  and  (4.4)  for  A*  to  get 


and  also 


w  +  ?n 


(4.5) 


(4.6) 


The  network  delay  (which  is  the  delay  of  the  one-way  trip  from  the  TC 
to  the  destination  node)  is  half  of  the  round-trip  delay,  so 

w  +  2n. 

T*  =  -yY---  Yj  >  0  and  w  >  0 


When  the  window  size  (w)  and/or  channel  transmission  rate  (y^)  is  zero 
there  is  no  traffic  flowing  through  the  network  and  we  define  the  delay 
to  be  zero;  therefore  we  have 
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(4.7) 


T* 


w  ♦  2n, 
n 


0 


Yj  >  0  and  w  >  0 


Yj=0  or  w=0 


T*  does  not  include  the  admission  delay  to  the  network  and  the  destina¬ 
tion  node  delay  (the  delay  messages  incur  in  the  destination  node  to  be 
sent  to  the  receiver). 

We  point  out  that  the  window  size  w,  in  the  way  being  used,  is 
actually  the  average  number  of  messages  in  the  network.  This  becomes 
clear  when  we  rewrite  Eq.  (4.4)  as 

w  =  A*T* 
a 

By  Little's  result  [LITT  61]  w  is  the  average  number  of  messages  in  the 
network  when  the  traffic  rate  is  A*.  Therefore,  in  a  narrow  sense,  in 
our  model  the  average  number  (rather  than  the  actual  number)  of  messages 
in  the  network  is  kept  fixed. 

Figs.  4.5-a  and  4.5-b  show  a  sketch  of  the  normalized  through¬ 
put  (A*/Yj)  and  the  normalized  delay  (T*y^)  as  a  function  of  the  window 
size.  For  comparison  purposes  we  have  also  shown  the  throughput  curve 
for  the  deterministic  model  in  Fig.  4.5-a.  It  is  seen  that  the  (maximum) 
throughput  asymptotically  approaches  its  limiting  value  and  the  delay 
gradually  (in  fact,  linearly)  increases  to  infinity.  This  figure 
depicts  the  effectiveness  of  the  window  mechanism  in  controlling  the 
traffic  more  explicitly.  Because  the  number  of  unacknowledged  messages 
is  limited  to  w,  when  the  network  delay  increases  (because  of  stochastic 
fluctuation),  the  input  rate  is  decreased  (Eq.  4.4)  to  keep  the  total 
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NORMALIZED  NETWORK  DELAY  (T*^)  NORMALIZED  MAXIMUM  THROUGHPUT 


DETERMINISTIC  MODEL 


STOCHASTIC  MODEL 


(a)  Maximum  Throughput 


Figurt  4.5  -  Skatch  of  Maximum  Throughput  and  Daiay  at  a  Function  of  Window  Size 


number  of  messages  in  the  system  constant.  This  fact  becomes  more  clear 
if  we  consider  the  maximum  allowable  window  size  for  a  given  delay. 

From  the  analysis  of  Section  4.2  we  know 


T* 

w  =  -* 
max  y 

*1 

Xj  being  the  average  transmission  time  of  a  message  on  a  channel  when 
there  is  no  queueing  delay.  Using  the  value  of  T*  as  given  by  Eq.  (4.6) 
and  considering  the  fact  that  =  l/yC^  =  1/y^,  we  have 


w 

max 


w  +  2n, 
n 


This  last  relationship  indicates  that  the  window  size  is  always  less 
than  the  maximum  allowable  window,  hence  the  system  never  "saturates," 
i.e.,  channel  utilization  never  reaches  1  (of  course,  as  long  as  w <  °°) . 

The  expression  for  the  network  delay,  Eq.  (4.7),  can  be 
rewritten  as 


T*  =  —  + 

Y, 


w 

2Y 


-=■  .  w  -7 

=  nhX!  2  X1 


The  first  term,  n^Xj,  represents  the  transmission  delay  and  the  second 
term  can  be  viewed  as  the  queueing  delay  caused  by  the  background  traf¬ 
fic.  Note  that  w  is  the  total  number  of  messages  on  the  path,  hence  w/2 
is  the  number  of  messages  that  are  being  sent  from  node  A  to  node  B. 

We  can  also  rewrite  Eq.  (4.5)  as  follows: 


X*  ~  V 

**  ♦  2  Y1 


where  =  w/n^  is  a  measure  of  the  load  in  a  node  (more  about  that  in 
Chapter  6) .  We  note  that  if  w  and  n^  both  vary  such  that  the  ratio  w/n^ 


does  not  change,  the  throughput  of  the  network  does  not  vary  either; 
however,  the  network  delay  changes.  This  fact  shows  that  from  the 
throughput  point  of  view,  all  networks  with  the  same  1*  are  identical. 

A*  and  T*  given  by  Eqs.  (4.5)  and  (4.7)  are  the  maximum  through¬ 
put  and  the  corresponding  network  delay  that  the  network  can  carry  when 
the  window  size  is  w.  For  an  input  rate  of  A  <  A*  the  network  delay 
becomes 

nh 

T  =  -  -  n  Y  0  <  A  <  A*  (4.8) 

yi  '  A  "  ” 

If  the  input  rate  is  larger  than  A*,  then  the  extra  traffic  is  not 
admitted  to  the  net.  Fig.  4.6-a  shows  a  sketch  of  the  maximum  through¬ 
put  as  a  function  of  w,  and  Fig.  4.6-b  shows  the  throughput-delay 
behavior  of  the  system.  If  the  window  size  is  set  to  w^,  then  the  max¬ 
imum  throughput  is  A*.  When  the  input  rate  is  A  <  A*  (therefore,  not 
all  of  the  window  buffers  are  used),  the  network  delay  is  determined  by 
Eq.  (4.8).  For  larger  window  sizes  the  maximum  input  rate  increases 
(w*  and  w^  in  Figure  4.6-a),  and  the  maximum  network  delay  increases  as 
well  (Fig.  4.6-b). 

The  question  which  now  arises  is,  what  is  the  best  choice  for 
the  window  size.  The  deterministic  model  of  Section  4.2  indicated  that 
a  window  size  of  w >  20^  gives  the  best  throughput,  and  that  the  best 
choice  for  w  is  2n^.  In  order  to  answer  this  question  for  the  stochastic 
model  of  this  section,  we  should  first  define  our  measure  of  "goodness." 
If  we  favor  high  throughput,  then  a  large  window  size  is  the  best,  but 
this  results  in  a  large  delay  as  well  (Figs.  4.5-a  and  4.5-b).  On  the 
other  hand,  if  delay  is  our  measure  of  performance,  then  a  window  of 
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MAXIMUM  THROUGHPUT  (X  •) 


X*  xlxj  THROUGHPUT  (X) 


Figure  4.6  •  Sketch  of  Throughput,  May  and  Power  of  a  Network. 
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size  0  is  the  best  choice  (the  throughput  for  this  window  size  is,  how¬ 
ever,  zero).  In  order  to  combine  both  throughput  and  delay,  we  need  to 
define  a  proper  performance  measure  which  reflects  the  opposing  effects 
of  throughput  and  delay.  To  this  end,  we  use  the  ratio  of  throughput 
and  delay  as  our  measure  of  performance.  This  ratio,  first  defined  by 
[(10 IS  76]  and  referred  to  by  them  as  power  (.^),  is  a  suitable  measure 
of  performance  because  both  high  throughput  and  low  delay  characterize 
efficiency  of  networks.  The  name  "power"  is  used  because  of  the  simi¬ 
larity  between  this  measure  and  the  power  (energy/time)  used  in  physics. 
In  Fig.  4.6-c  a  sketch  of  the  power  of  a  network  with  path  length  and 
channel  transmission  rate  of  y  is  shown  as  a  function  of  its  input  rate. 
(Note  that  X  in  this  figure  is  the  uncontrolled  input  rate).  Figs. 

4.6-b  and  4.6-c  show  some  interesting  properties  of  power.  When  the 
input  rate  is  either  zero  or  equal  to  y,  the  power  is  zero.  As  X 
increases  from  0  to  y,  &  reaches  a  peak,  which  corresponds  to  the  point 
in  Fig.  4.6-b  where  a  line  drawn  from  the  origin  is  tangent  to  the  delay 
curve  (X*  is  Figure  4.6-b).  This  value  of  input  rate  is  a  "good"  oper¬ 
ating  point  for  the  system  in  the  sense  that  in  the  interval  0  X  <  X* 
the  rate  of  increase  of  delay  is  less  than  X.  However,  in  the  interval 
of  X'v  <  X,  T  increases  faster  than  the  input  rate;  therefore,  the  partic¬ 
ular  input  rate  which  maximizes  the  power  is  a  good  choice  of  X.  (This 
fact  becomes  more  clear  when  power  has  a  sharp  peak.)  When  the  input  to 
a  network  is  controlled  by  the  window  mechanism,  we  can  use  Eqs.  (4.5) 
and  (4.7)  to  find  the  power  for  the  maximum  input  rate  (X*). 

AX* 

*  “  T*  "  (w+  2nh)/2y 
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4.4  A  Generalized  Stochastic  Model 

To  improve  the  model  of  Section  4.3,  we  make  futher  assumptions 
regarding  the  operation  of  different  parts  of  a  network.  More  specif¬ 
ically  we  elaborate  on  the  following  issues: 

•  The  acknowledgement  and  timeout  strategy 

•  The  (sub)network  delay  distribution 

•  The  destination  node  structure 

4.4.1  The  Acknowledgement  and  Timeout  Strategy 

So  far,  we  have  assumed  that  once  the  TC  delivers  a  message  to 
the  (sub)network,  the  message  is  guaranteed  to  be  received  and  accepted 
by  the  destination  node  buffer  and  is  acknowledged  for  correct  reception. 
This  assumption  implies  that  a  message  has  to  be  transmitted  only  once, 
hence  the  input  rate  of  external  traffic  to  the  TC  is  equal  to  the  out¬ 
put  rate  of  messages  from  it.  In  real  life  there  are  always  chances 
that  the  acknowledgement  for  a  message  does  not  return.  This  happens 
for  a  variety  of  reasons,  some  of  which  we  list  below: 

•  Network  channels  are  not  noiseless  and  a  message  may  arrive  in 
the  destination  node  with  some  bits  in  error.  Such  messages 
are  not  accepted  by  the  destination  node. 

•  Communication  processors  in  a  network  have  finite  buffer  stor¬ 
age  for  store-and- forward  (in  the  intermediate  nodes)  and 
reassembly  (in  the  destination  node).  When  the  buffer  storage 
is  full,  no  further  messages  can  be  accepted. 
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•In  the  case  of  multipacket  messages,  if  a  packet  is  received 
out  of  order,  it  is  not  accepted  by  the  destination  node  (this 
of  course  depends  on  the  protocols  being  used,  as  in  the 
ARPANET  [KLEI  76B] -  See  also  section  4.1). 

In  all  of  the  above  cases  no  acknowledgement  is  sent  back  to 
the  TC.  If  a  TC  keeps  waiting  to  receive  an  ACK  for  a  message,  chances 
are  that  it  will  never  receive  such  an  ACK  and  the  copy  of  the  unac¬ 
knowledged  message  is  kept  forever  in  its  buffer  space  and  (because  the 
number  of  unacknowledged  messages  is  limited)  it  cannot  accept  new  exter¬ 
nal  traffic;  consequently  the  network  throughput  degrades.  To  prevent 
this  degradation,  traffic  controllers  are  equipped  with  a  timeout  mecha¬ 
nism  such  that  when  the  ACK  for  a  message  is  not  received  within  a 
certain  period  (the  timeout),  the  TC  retransmits  the  message.  This 
process  is  continued  until  an  ACK  is  received  by  the  TC,  at  which  time 
the  copy  of  the  message  is  dropped  from  the  traffic  controller's  buffer 
and  a  new  message  can  be  accepted.  Fig.  4.7-a  shows  the  structure  of 
the  TC.  After  delivering  a  message  to  the  (sub)network,  the  TC  stores 
a  copy  of  the  message  in  its  timeout  box.  If  an  ACK  for  this  message  is 
received  within  t  seconds  (the  timeout  period),  the  message  is  dropped; 
else  the  message  is  retransmitted.  As  shown  in  this  figure,  the  delivery 
of  messages  from  the  TC  to  the  (sub)network  is  instantaneous  and  there 
is  no  queueing  delay  involved.  Because  of  retransmission,  the  acceptance 
rate  of  messages  from  outside  (A)  is  different  from  (and  less  than)  A  , 
the  message  delivery  rate  to  the  (sub) network. 

We  should  point  out  that  if  an  ACK  does  not  arrive  during  the 
timeout  period,  that  does  not  imply  that  it  will  never  be  received. 
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Also,  due  to  the  randomness  of  the  round-trip  delay,  ACK's  for  dif¬ 
ferent  (re)transmissions  of  a  message  may  return  out  of  order.  Consider 
the  following  cases: 

I.  After  the  first  timeout  the  TC  retransmits  the  message,  and 

while  it  is  waiting  for  the  second  acknowledgement  the  ACK  for 
the  first  transmission  comes  back.  At  this  time  the  copy  of 
the  message  should  be  dropped  and  a  new  message  can  be  accepted. 

II.  The  ACK  for  a  second  retransmission  may  arrive  before  the  ACK 
for  the  first  one.  Again,  at  this  time  the  TC  can  accept  a 
new  message  to  the  network. 

In  fact,  reception  of  an  ACK  for  a  message  signifies  its  correct  deliv¬ 
ery  to  the  destination  node.  To  simplify  the  analysis  we  require  that 
the  ACK  for  the  last  (re) transmitted  message  should  be  received  so  that 
the  TC  can  accept  a  new  message  to  the  network.  This  means,  once  an 
ACK  is  not  received  within  the  timeout  period,  we  assume  the  transmis¬ 
sion  has  failed  and  any  ACK  received  after  the  timeout  period  is  ignored 
As  a  consequence  of  this  assumption,  in  the  first  case  given  above  the 
ACK  is  ignored,  but  in  the  second  case  it  is  considered  as  an  indication 
of  the  correct  delivery  of  the  message.  The  reason  for  this  simplifying 
assumption  is  that  modeling  of  the  exact  operation  is  extremely  diffi¬ 
cult  and  the  analysis  is  unnecessarily  complicated.  One  way  of  modeling 
the  exact,  operation  is  to  tag  messages  with  a  class  indicator.  Newly 
arrived  messages  are  tagged  as  Class  1,  and  after  each  time  out  the 
class  indicator  is  increased  by  1.  The  resulting  model  can  be  analyzed 
as  a  network  of  queues  with  different  classes  of  customers  [BASK  75); 


however,  the  analysis  is  fairly  complicated  and  for  this  reason  we 
accept  the  simplified  assumption  stated  above  in  this  chapter. 

4.4.2  The  (Sub)network  Delay  Distribution 

A  detailed  specification  of  the  structure  of  the  (sub)network 
is  not  the  object  of  our  study;  we  view  the  (sub)network  as  a  source  of 
a  random  delay  which  we  choose  to  represent  by  a  proper  distribution 
function.  In  [WONG  77]  it  is  shown  that  the  end-to-end  delay  in  a 
Markovian  network  of  queues  is  the  sum  of  independent  and  exponentially 
distributed  random  variables.  Simulation  and  measurement  studies  of 
existing  networks  [COLE  71],  [KLEI  74B] ,  [FORG  7S] ,  [KLEI  76B]  show  that 
a  multistage  Erlangian  distribution  is  a  good  approximation  for  the 
round-trip  delay  distribution.  Ideally,  an  Erlangian  distribution  of 
degree  larger  than  10  is  the  best  choice;  however,  any  degree  larger 
than  2  makes  the  derivations  and  the  integrations  extremely  complicated. 
Considering  the  fact  that  the  nrotocol  performance  is  relatively  insen¬ 
sitive  to  the  degree  of  the  distribution  ([SUNS  75],  also  our  analysis 
with  degrees  =  1  and  2),  we  choose  to  use  an  Erlangian  distribution  of 
the  second  degree  (concatenation  of  two  exponential  distributions)  for 
the  round-trip  delay.  We  further  assume  that  one  one-way  delay  is  simi¬ 
lar  to  the  system  delay  of  a  simple  M/M/1  queueing  system.  Fig.  4.7-b 
shows  the  structure  of  the  (sub)network  used  in  our  model.  Because 
messages  which  contain  errors,  or  which  cannot  be  accepted  at  the  desti¬ 
nation  buffer,  are  not  acknowledged,  the  acknowledgement  delay  may  be 
infinite.  We  present  this  fact  in  our  model  by  including  an  impulse 
(of  size  P^,  the  probability  of  loss)  in  the  ACK  delay  distribution  at 
time  equal  to  infinity.  Fig.  4.8  shows  a  sketch  of  the  density  and  the 


distribution  function  of  acknowledgement  delay. 


4.4.3  The  Destination  Node  Structure 

In  Section  4.4.1  we  assumed  that  the  TC  must  receive  an  ACK  for 
the  last  (re)transmitted  message.  This  implies  that  the  destination 
node  may  receive  multiple  copies  of  the  same  message.  Clearly,  only 
the  first  received  copy  should  be  delivered  to  the  receiving  user  and 
the  later  copies  must  be  discarded.  Fig.  4.7-c  shows  the  structure  of 
the  destination  node.  An  arriving  message  is  first  checked  for  duplica¬ 
tion  and  if  it  is  the  first  copy,  it  is  passed  to  the  buffer  storage. 

If,  however,  the  message  is  found  to  be  a  duplicate  of  a  previously 
accepted  one,  it  is  discarded,  but  an  ACK  for  it  is  sent  back  to  the 
sending  TC.  If  there  is  space  available  at  the  buffer,  the  message  is 
admitted  and  is  put  in  the  queue  to  be  delivered  to  the  receiving  user 
and  an  ACK  is  sent  back  to  the  TC.  If,  however,  there  is  no  buffer  to 
store  the  message,  it  is  rejected  and  no  ACK  is  sent  back.  As  we  note, 
a  message  is  not  acknowledged  only  if  the  buffer  storage  is  full.  This 
means  we  are  implicitly  assuming  that  all  messages  are  received  error 
free.  The  fact  that  the  channel  error  rate  is  usually  very  small  (on 
the  order  of  10~5)  justifies  our  assumption. 

4.4.4  Analysis 

We  summarize  the  assumptions  stated  above  as  follows: 
Assumptions  4.2 

1.  Poisson  arrival  of  the  messages  to  the  (sub)  network. 

2.  Exponential  message  length  distribution  (with  mean  1/y) 

3.  ACK  for  the  last  (re) transmitted  message  must  be  received. 


4.  Round-trip  delay  has  an  Erlangian  distribution  of  degree  two 
(summation  of  two  exponential  distributions)  consisting  of  concate¬ 
nation  of  two  system  delay  distributions  of  an  M/M/1  queue: 


-(Yi-M*  -(Yi-Ae)t 

F?  (t)  =  1  -  e  e  -  (Yj-yte 

where  is  a  random  variable  representing  the  round-trip  delay, 
A@  is  the  (sub)network  traffic  rate  and  Yj  =  yC^,  Cj  being  the 
network  channel  capacity. 


5.  Destination  node  has  a  finite  number  of  buffers. 


6.  At  the  destination  node  only  the  first  successful  copy  of  a 
message  is  accepted;  however,  ACKs  for  later  copies  are  sent  back. 

7.  Negligible  channel  error  rate. 

8.  No  message  sequencing  (see  Section  4.1) 


We  start  our  analysis  by  deriving  the  ACK  delay  distribution. 
Based  on  Assumption  4.2-4  the  ACK  delay  distribution  for  successful 
transmission  is 

Ff  (t  |  no  loss)  =  1  -  -  (y  -  A  Jte'  (Yl"Ae)1; 

t  1C 

a 

where  y.  =  pC  and  C.  is  the  network  channel  capacity,  A  is  the  effec- 
111  c 

tive  network  traffic  (which  includes  retransmissions)  and  t  is  a  random 

a 

variable  representing  the  acknowledgement  delay.  We  also  have 

F~  (t  |  loss)  =  0 
a 

Therefore,  the  distribution  of  the  acknowledgement  delay  becomes 
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Fl  (t)  *  (1- P4)[l- e'(Yl'Ae)t  -  CYl-Vte'(Yl'Ae)t]  (4'9) 


and  its  density  function  will  be 

2  -(Y.-X  )t 

fl  ct)  .  d-pt)(Yi-  V  te  1  6  (4‘10) 

a 

where  P^  is  the  probability  that  an  ACK  is  not  received. 

Next  we  calculate  the  retranmission  probability.  Retransmis¬ 
sion  is  necessary  when  either  the  message  is  lost  at  the  destination 
buffer,  or  the  transmission  is  successful  but  the  acknowledgement  delay 
is  greater  than  the  timeout. 

Pf  =  Pr  [retransmission] 

=  P^  +  (1  -  Pfc)  Pr[ta>T  |  no  loss] 


pr  =  P£  ♦  0  -  PgHl  ♦  (Yl-  Xe)]e"(Yl‘Ae)T  (4.11) 


Because  of  retransmissions,  the  effective  network  traffic  is 
larger  than  X,  the  input  rate  of  messages  to  the  network.  They  are 
related  to  each  other  as  follows: 


(4.12) 


We  now  consider  the  length  of  time  a  message  occupies  a  buffer 
in  the  TC.  We  use  a  renewal  theory  argument  to  find  this  quantity. 


Let 


T1  ■ 


Then 


Tqc  =  E[time  of  occupancy] 

»  T.(l  -  P  )  ♦  (t  +  T  )P 
lv  rJ  K  oc'  r 


Solving  this  last  equation  for  Toc>  we  get 


T  =  T,  + 


oc  1  1  -  P 


Calculation  of  is  fairly  easy.  By  definition,  we  have 


Ti  *  ElU?„£T!  ■  /*  dF?  |l  Sr(t) 

a  a 


where 


h  it  ■  Prttait|t  <t] 

a'  a 


but 


Pr[ta<  t] 
Pr  [t  <  t] 


t  <  T 

irrlLo2lJ  “ 

0  t  >  T 

Using  the  above  relationships  and  after  some  algebra  we  get 


(Y 


.  ,  2  -<VXe>T 

1  '  Ae)T  e 


Y,  -  *  - (Yi -Alt 

1  e  1  -  [1+  (Yl- Ae)t]e  1  e 


and  finally  we  find  the  value  of  T  as  follows: 

oc 


P£  +  d-P£)e 


-(YrVT 


OC  Y1  ‘  Ae  ,  -(Yi-OT 

(1-P£){1-  [1  ♦  (Yl  -  Ae)x]e  1  e  } 


(4.13) 
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Summing  the  relationships  we  have  derived  so  far,  we  have 

-At 


Pr  =  P£  ♦  (1  -  PjHl  +  Ax)e 


(4.11) 


e  1  -  P. 


A  = 


T  *  t  * 
oc  A 


Cl  -  Pt)[l  -  (1*  Al)e"AT) 

Pt  *  »  -  P«)e‘At 

- I7TT  T 

(1  +  P^Hl  -  (1  ♦  Ax)e  AT] 


(4.12) 


(4.13) 


where  A  =  y.  -  A 
I  © 


Because  a  maximum  of  w  (the  window  size)  messages  are  accepted 


to  the  network  every  T  seconds,  the  maximum  input  rate  will  be 
'  oc 


A*  = 


(4.14) 


oc 


When  the  input  rate  is  A  •  A*,  from  Eq.  (4.12)  we  have 


X  -  * _ 

e  1  -  P  T 

r  oc 


(4.15) 


We  can  use  the  values  of  P  and  T  from  Eqs.  (4.11)  and  (4.13)  in 

r  oc 

Eq.  (4.15)  to  get 


w  A 


e  2(1  -  P£)  [  1  -  (1  +  Ax)e"AT]  +  [Pfc  +  (1  -  P£)e~AT]AT 


or  in  general 


Afi  =  Gjfx,  w,  Yj,  P^,  Ag) 


(4.16) 


The  unknowns  in  the  above  equation  are  Ag  and  P^,  the  proba¬ 
bility  that  an  ACK  is  not  returned.  Because  the  channels  are  assumed 
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to  be  noiseless,  P n  is  the  probability  that  a  message  is  rejected  from 


the  destination  buffer.  It  follows  from  our  previous  assumptions  that 


the  input  process  of  messages  to  the  destination  node  is  Poisson.  How 


ever,  because  the  duplicate  messages  are  not  stored  in  the  buffer  (but 


are  acknowledged,  see  Fig.  4.7-c),  the  counting  process  of  message 


arrivals  to  the  buffer  storage  is  no  longer  Poisson.  In  order  to  sim 


plify  the  derivation  of  the  rejection  probability  we  make  one  more 


assumption  regarding  this  process 


Assumption  4.2  (continued) 


9.  The  counting  process  of  message  arrivals  to  the  destination 


buffer  is  Poisson 


By  virtue  of  Assumption  4.2-2,  the  destination  buffer  behaves 


like  an  M/M/l/B  queueing  system  where  B  is  the  buffer  size  at  the  desti 


nation  node.  To  calculate  the  loss  probability  P«,  we  notice  that  at 


equilibrium  the  input  rate  of  messages  to  the  network  (X)  is  equal  to 


the  output  rate  of  messages  from  it.  Hence  the  total  arrival  rate  of 


messages  to  the  buffer  (the  accepted  messages  plus  the  rejected  ones) 


(4.17) 


where  and  is  the  capacity  of  the  output  channel. 

The  above  equation  may  be  written  as 

pz  =  g2(y2*  b*  pr  V  (4-18) 

The  analytic  results  presented  so  far  depend  upon  the  solution 
of  the  two  equations,  (4.16)  and  (4.18).  For  a  given  value  of  the 
parameters  (t,  w,  y^,  y^,  and  B)  the  above  system  of  equations  can  be 
solved  through  an  iterative  procedure  [HILD  66]  and  [ORTE  70]. 

Having  found  the  network  traffic  Xg,  we  can  use  Eq.  (4.12)  to 
find  the  maximum  input  rate  (or  the  maximum  throughput)  of  the  network. 
The  total  end-to-end  delay  consists  of  two  components:  the  network 
delay  and  the  destination  buffer  delay.  In  calculating  the  network 
delay  we  should  notice  that  only  the  delay  of  the  first  successful  mes¬ 
sage  is  of  importance  to  us;  further  retransmissions  only  effect  the 
network  traffic  (hence  have  an  indirect  effect  on  the  delay) .  The  aver¬ 
age  network  delay  can  be  found  through  a  renewal  theory  argument  [COX  62] 
as  follows: 

T  ■  T-br  (1  -  V  *  <T*T>pll 

In  the  above  equation  1/ (y ^  -  Ae)  is  the  one-way  network  delay  (from  the 
TC  to  the  destination  node)  if  no  retransmission  (due  to  rejection)  is 
required.  If  a  message  is  rejected,  after  t  seconds  it  is  retransmitted 
(the  transmission  process  renews).  The  second  term  corresponds  to  the 
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delay  when  retransmission  (due  to  rejection)  is  necessary.  Solving  the 
above  equation  for  T,  we  get 


T  = 


1 


n  -  Xe  1  -  pt 


(4.19) 


The  other  component  of  the  total  delay  is  the  destination  buffer  delay. 
This  is  equivalent  to  the  system  delav  of  the  corresponding  M/M/l/B 
queue  [KLEI  75] 


i/yc2  1  -  (1  +  B)pj  +  Bp®*1 


dst  1  -  p  , 


B 


(4.20) 


where  p,  =  A ,/pC~  is  the  utilization  factor  of  the  destination  buffer 
adz 

and  C 2  is  the  output  channel  capacity. 

Finally  we  have 


where  T 

ee 


T  =  T  +  T , 
ee  dst 


is  the  end-to-end  delay. 


(4.21) 


The  analysis  of  this  section  has  been  based  on  the  equilibrium 
of  the  system  for  a  selected  set  of  parameters  (x,  w,  p,  C^,  C 2>  B) 
which  are  not  dynamically  adjusted  to  the  stochastic  fluctuations  in  the 
load  of  the  system.  We  will  refer  to  the  mechanism  analyzed  above  as 
"static  flow  control." 


4.4.5  Numerical  Results 

In  this  section  we  present  numerical  examples  for  static  flow 
control  and  study  the  effect  of  different  parameters  (w,  x  and  B)  on  the 
throughput-delay  performance  of  a  network.  For  a  given  set  of  parameters 
we  can  find  the  maximum  throughput  (A*)  and  the  corresponding  (maximum) 


total  delay  (T*  ).  Whenever  there  is  no  ambiguity,  we  will  refer  to 
66 

these  quantities  simply  by  throughput  and  delay.  Throughout  our  presen¬ 
tation  we  consider  the  normalized  values  of  certain  quantities;  the 
normalization  constant  is  =  l/(yCj),  the  average  transmission  time  of 
a  message  on  a  network  channel.  It  also  turns  out  that  only  the  ratio 

Cj/C^  is  of  importance  to  us,  which  we  will  use  in  this  section. 

We  start  by  studying  the  effect  of  the  timeout  period  on  the 
network  throughput.  Fig.  4.9  shows  the  normalized  throughput  (X*/yCj) 
as  a  function  of  the  normalized  timeout  (t/X^)  for  different  window 
sizes  when  the  destination  buffer  size  (B)  is  10  and  C^/C^=  1.  It  can 
be  seen  that  when  the  timeout  is  very  small  the  throughput  decreases 
to  zero.  The  reason  for  this  degradation  is  as  follows:  when  the  time¬ 
out  is  small,  the  TC  retransmits  messages  very  fast,  hence  the  (sub) net¬ 
work  channel  utilization  (X  /pC.)  approaches  1  and  the  network  delay 
becomes  unbounded;  because  the  number  of  unacknowledged  messages  is 
limited,  no  new  message  is  admitted  to  the  network  and  the  throughout 

degrades  to  zero  (Eq.  4.14).  It  is  interesting  to  note  that  in  order 

for  the  system  to  remain  stable,  the  normalized  timeout  should  be  larger 
than  the  window  size.  This  fact  is  intuitively  clear  in  the  determinis¬ 
tic  case;  because  the  transmission  of  w  (w  is  the  window  size)  messages 
takes  wXj  seconds  (X^  being  the  transmission  time  of  one  message),  a 
timeout  of  x  <  wX^  causes  retransmission  of  a  message  to  start  before 
transmission  of  the  w  messages  can  possibly  be  completed;  as  a  result, 
the  network  becomes  saturated.  We  can  establish  this  fact  analytically. 
For  stability  we  require 

X  <  y, 
e  '  1 
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NORMALIZED  TIMEOUT  (t/x,) 

-  Dependency  of  Network  Throughput  on  Timeout 


Using  the  value  of  Xg  from  Eq.  (4.15),  the  above  inequality  becomes 

1  w 

1  -  P  T  Y1 
r  oc 


-7-  <  (1  -  P  )T 

Yj  rJ  oc 

Using  the  values  of  Pf  and  Tqc  from  Eqs.  (4.11)  and  (4.13)  we  get 


w  -At  (2  ^  / 

~  <  (1  -  P£)  [1  "  Cl  -  ADe  AT]  f  ♦  - - 5 - 

Y1  (  (i  -  P£) [i  -  (1  +  Ax)e  AT]  ) 


where  A  =  y,  -  X 
'  1  e 


As  approaches  y^ ,  A  approaches  zero  and  we  can  approximate 
-At 

e  by  the  first  term  of  its  expansion 


e”  -  1  -  At  as  A  4-  0 


-At 

Using  this  value  of  e  in  the  above  inequality,  and  after  some  algebra 
we  get 


~  <  [(1-  P.)(1*At)  ♦  PJt 


However,  because  A  =  0,  this  inequality  reduces  to 


(4.22) 


which  is  what  we  were  looking  for. 

Continuing  our  discussion.  Fig.  4.9  also  shows  that  when  the 
timeout  increases,  the  network  throughput  first  grows;  however,  after 
a  certain  value  of  t,  the  throughput  starts  to  decrease.  In  fact,  when 
the  destination  buffer  size  (B)  is  finite,  as  rt”  the  throughput 


/ 


degrades  to  zero,  i.e., 


lim  X*  =  0 

T+°° 

This  fact  is  easily  shown  by  using  Lqs.  (4.i4)  and  (4.15) 

lim  A*  =  lim  =r—  =  lim  — - - - -r -  =  0 

xfoo  Xfoo  oc  xt«>  2  +  (1  -  P^e'  T 

1  (1  -  P£)[l  -  (1  +  Ax)e'AT]  T 

Fig.  4.9  shows  that  this  throughput  degradation  is  more  severe  for 
larger  window  sizes.  The  reason  for  this  degradation  is  that  for  a 
large  timeout  the  TC  waits  for  a  long  time  for  an  ACK  to  come  hack;  how¬ 
ever,  because  of  the  finite  destination  buffer  size,  there  is  a  non-zero 
probability  that  an  ACK  never  comes  back,  hence  the  buffer  of  the  traffic 
controller  becomes  full  of  unacknowledged  messages,  and  no  further  exter¬ 
nal  traffic  is  accepted.  We  should  point  out  that  this  degradation  is 
not  present  if  the  destination  buffer  size  is  infinite;  the  throughput 
curve  in  Fig.  4.9  for  w=  10  and  the  destination  buffer  size  of  infinity 
explicitly  shows  this  fact.  Fig.  4.9  shows  that  for  a  fixed  window  size, 
there  is  a  value  of  timeout  that  optimizes  (maximizes)  the  throughput. 

We  will  designate  this  optimal  value  by  x*  (as  opposed  to  x^,  which  is 
the  optimal  value  of  timeout  with  respect  to  the  end-to-end  delay;  see 
below).  This  figure  shows  that  when  the  window  size  is  small,  the 
throughput  curve,  after  an  initial  increase,  remains  quite  flat,  and 
only  for  large  window  sizes  is  the  maximal  throughput  sensitive  to 
timeout. 

The  effect  of  the  timecut  period  on  delay  is  shown  in  Figs. 
4.10-a,  4.10-b  and  4.10-c.  Fig.  4.10-a  shows  the  normalized  (maximum) 
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10.0  100.0  1000.0 
NORMALIZED  TIMEOUT  (r  /x.,) 

Figure  4.10-a  End-To-End  Delay  a*  a  Function  of  Timeout 
(B  ■  10;  C^/C2  “  1-0). 


•10-b  Network  Delay  at  a  Function  of  Til 


end-to-end  delay  (T^/Xj)  as  a  function  of  the  normalized  timeout  for 
different  window  sizes  (end-to-end  delay,  as  defined  in  Section  4.4.4, 
is  the  sum  of  the  network  delay  and  the  destination  node  delay).  As  in 
Fig.  4.9,  this  figure  also  shows  that  for  a  fixed  window  size,  there  is 
a  value  of  timeout  which  optimizes  (minimizes)  the  end-to-end  delay; 
this  optimal  timeout  (x£,  d  for  delay)  is  usually  different  from  x*. 

When  the  timeout  is  too  small,  like  the  throughput,  the  end-to-end  delay 
becomes  unbounded  (the  minimum  allowable  timeout  is,  of  course,  wX^);  as 
the  value  of  the  timeout  increases,  the  end-to-end  delay  first  reaches 
a  minimum  and  then  becomes  unbounded.  As  Figs.  4.10-b  and  4.10-c  show, 
this  unboundedness  is  due  to  the  network  delay. 

We  now  study  the  effect  of  the  window  size  on  the  network  per¬ 
formance.  In  Fig.  4.11  the  normalized  throughput  is  shown  as  a  function 
of  the  window  size.  On  the  curve  designated  by  "optimal  throughput," 
for  each  window  size  the  optimal  value  of  timeout  (x* )  is  chosen  such 
that  the  throughput  becomes  maximal;  whereas  on  the  curve  designated  by 
"optimal  delay,"  the  optimal  value  of  timeout  (x*)  is  chosen  so  that  the 
end-to-end  delay  becomes  minimal.  This  figure  also  shows  the  throughput 
curves  for  several  constant  (normalized)  timeouts.  For  a  fixed  timeout, 
the  network  throughput  initially  increases  as  w  increases;  however  at 
a  certain  value  of  window  size  the  throughput  curve  reaches  its  maximum 
and  then  falls  down;  finally  as  w  becomes  greater  than  x/Xj  the  curve 
goes  to  zero,  which  agrees  with  our  previous  result  (Eq.  4.22).  The 
throughput  curves  for  constant  timeout  display  the  known  behavior  of  a 
contention  system  (described  in  the  opening  of  this  chapter);  here  the 
window  size  represents  the  load  on  the  system.  Note  that  the  optimal 


NORMALIZED  MAXIMUM  THROUGHPUT 


WINDOW  SIZE 

Figure  4.11  -  Network  Throughput  w.  Window  Sin 
(B  ■  10;  tyC2«  1.0). 
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throughput  curve  is  the  maximum  envelope  of  these  contours,  and  defines 
the  tight  upper  bound  on  the  throughput -window  size  performance. 

The  end-to-end  delay  as  a  function  of  window  size  is  shown  in 
Fig.  4.12,  which  is  the  counterpart  of  Fig.  4.11.  In  this  figure,  the 
contours  of  constant  (normalized)  timeout  peel  off  from  the  delay  curve 
of  the  optimal  throughput,  become  tangent  to  the  optimal  delay  curve  and 
finally  become  unbounded  at  a  value  of  the  window  size  equal  to  x/X^. 

In  analogy  to  Fig.  4.11,  the  optimal  delay  curve  in  Fie.  4.12  is  the 
minimal  envelope  of  the  contours  of  constant  t's,  hence  it  is  the  tight 
lower  bound  on  the  delay-window  size  performance. 

To  show  the  effect  of  the  contention  more  explicitly,  we  have 
plotted  the  network  throughput  versus  the  network  traffic  for  (constant) 
normalized  timeouts  in  Fig.  4.13.  For  a  fixed  t,  an  increase  in  the  win¬ 
dow  size  results  in  an  increase  in  the  network  traffic.  Initially  the 
throughput  increases  as  the  network  traffic  grows;  however,  after  a  cer¬ 
tain  value  of  Xe>  the  throughput  starts  declining,  and  when  the  network 
saturates  (Xg/pC1  =1)  the  throughput  decreases  to  zero.  In  the  same  fig¬ 
ure  we  have  also  shown  the  contours  of  constant  window  size.  Note  that 
for  a  fixed  window  size,  when  the  timeout  increases,  throughput  first 
grows  and  then  levels  off;  however  the  network  traffic  decreases  contin¬ 
uously.  Note  also  that  all  w-contours  peel  off  from  a  common  upper  bound 
curve,  which  is  the  maximal  throughput  curve  (Fig.  4.11).  Note  how  the 
window  size  and  timeout  vary  along  this  curve;  in  fact,  if  any  one  of 
these  two  parameters  (w  and/or  x)  is  held  fixed  and  the  other  is  increased, 
the  throughput  decreases  to  zero;  only  when  both  w  and  x  are  increased 
simultaneously  (such  that  for  each  w  the  optimal  x*  is  chosen)  will  the 
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throughput  not  degrade  (see  the  maximal  throughput  curve  in  Fig.  4.13). 
This  figure  shows  that  the  (normalized)  throughput  never  reaches  1;  even 
when  w  and  x  are  chosen  optimally,  the  throughput  is  limited  by  the 
destination  node  buffer  size.  The  price  for  having  more  throughput  is, 
of  course,  the  delay  which  is  shown  in  Fig.  4.14.  This  figure  shows 
the  throughput-delay  equilibrium  for  fixed  values  of  (normalized)  time¬ 
outs  and  window  sizes.  The  minimum  envelope  of  these  contours  defines 
a  tight  lower  bound  on  the  throughput-delay  performance  and  thu$  can  be 
veiwed  as  the  optimal  network  performance  for  the  window  mechanism  model. 
As  we  pointed  out  earlier,  the  (normalized)  throughput  never  reaches  1; 
the  maximum  throughput  (which  is  achieved  when  w  t  <*>  and  x  +  °°  such  that 
x  is  optimal)  is  determined  by  the  destination  buffer  size  and  the  desti¬ 
nation  channel  capacity.  For  example,  in  Fig.  4.14  this  maximum  is 
achieved  when  the  network  traffic  approaches  its  maximum  (Ae=pCj, 
being  the  network  channel  capacity) ,  and  for  this  network  traffic  the 
(normalized)  throughput  is  approximately  0.909  (and  the  end-to-end  delay 
is,  of  course,  unbounded).  In  Fig.  4.15  we  have  shown  the  optimal 
throughput-delay  curves  for  different  destination  buffer  sizes  and  a 
fixed  destination  channel  capacity  (Cj/C^l).  Along  each  curve  the 
timeout  and  the  window  size  vary,  and  we  have  also  shown  the  contours 
of  constant  window  size.  This  figure  shows  that  for  a  fixed  end-to-end 
delay  the  network  throughput  increases  rapidly  as  the  destination  buffer 
size  increases.  (Note  that  for  B =  20  the  maximum  throughput  is  almost 
achieved.)  When  the  channel  capacities  (network  channels  as  well  as  the 
destination  node  channel)  are  known,  this  figure  can  be  used  as  a  design 
tool  to  determine  a  proper  destination  buffer  size.  For  example,  if  we 
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set  a  minimum  level  of  throughput,  say  A/pC^  >  0.7,  and  a  maximum  end- 
to-end  delay,  say  T  /Xj  <  10,  then  this  figure  shows  that  the  destination 
buffer  size  should  be  at  least  5  (the  point  specified  by  t lie  intersection 
of  the  light  dashed  lines  in  Fig.  4.15).  The  operating  rule  is  to  use 
a  window  size  of  5,  and  the  optimal  timeout  should  be  determined  using 
a  curve  like  the  one  in  Fig.  4.9.  Note  that  a  destination  buffer  size 
of  B > 5  should  also  satisfy  our  requirements:  however,  we  have  chosen 
the  minimum  buffer  size  to  minimize  the  overall  network  cost. 

When  the  destination  buffer  size  and  the  capacities  of  channels 
are  known,  then  it  is  important  to  know  what  (w,t)  combination  to  use 
in  order  to  provide  a  certain  grade  of  service.  In  Fig.  4.16  we  have 
plotted  design  contours  of  constant  (normalized)  throughput  (the  heavy 
solid  curves),  and  (normalized)  end-to-end  delay  (the  heavy  dashed 
curves);  we  will  refer  to  these  curves  as  A*-contours  and  r*-contours 
respectively.  This  figure  displays  most  of  the  information  contained  in 
the  previous  figures;  in  particular  we  observe  the  following  character¬ 
istics  : 

1.  The  loci  of  minimum  (normalized)  timeouts  is  the  lowest 
A*-contour  (which  coincides  with  the  highest  T*-contour)  designated  by 
"w=t/X^"  in  Fig.  4.16.  Along  this  curve  the  throughput  is  zero  and  the 
delay  is  unbounded  (infinite). 

2.  A  line  of  constant  window  size  (a  line  perpendicular  to 
the  window  size  axis)  may  intersect  a  specified  A*-contour  (say  A*  =  A*) 
at  (a)  two  points;  (b)  one  point  (tangent  to  the  A*  (  =  AJ)-contour); 
and  (c)  no  point.  In  case  (a)  the  two  values  of  timeout  give  the  same 
throughput  (but  the  delays  at  these  two  points  are  different  from  each 
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Figure  4.16  -  Contour*  of  Constant  Throughputs  and  End-To-End  Delays  (B  -  10;  C1/C2  ■  1.0). 
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other).  In  case  (b)  the  value  of  timeout  at  the  tangent  point  is  x*, 
the  optimal  timeout  for  maximal  throughput  (compare  with  Fig.  4.9).  We 
have  indicated  the  loci  of  these  points  by  curve  A  in  this  figure  (which 
we  will  refer  to  as  the  x*  -  curve).  If  the  line  of  constant  window  size 
does  not  intersect  the  A* (=  A*) -contour  (case  (c)),  that  indicates  that 
a  throughput  of  A*  cannot  be  realized  by  this  window  size. 

3.  The  above  discussion  can  be  carried  out  for  the  T*-contours; 
the  loci  of  points  where  the  constant  w  lines  are  tangent  to  T* -contours 
(the  x*-curve)  is  designated  by  curve  B. 

4.  We  can  use  Fig.  4.16  to  find  the  maximum  achievable  through¬ 
put  (and  the  corresponding  window  size)  for  a  fixed  timeout.  The  A* - 
contour  which  is  tangent  to  the  constant  x  line  (a  line  perpendicular  to 
the  x  axis  with  an  intercept  equal  to  the  specified  timeout)  determines 
the  maximum  throughput,  and  the  value  of  the  window  size  at  the  tangent 
point  is  the  w  which  provides  this  throughput.  It  is  interesting  to 
note  that  this  value  of  the  throughput  is  the  point  where  the  x-constant 
line  intersects  the  envelope  of  the  throughput  curves  in  Fig.  4.9  (we 
have  not  drawn  this  curve) . 

5.  Fig.  4.16  can  be  used  to  find  the  best  (w,x)  combination 
which  guarantees  a  certain  grade  of  service.  Assume  that  for  a  given 
network  (for  which  the  parameters  y,  C^,  and  B  are  known)  we  are 
required  to  provide  at  least  a  minimum  throughput  (say  A*/yC^ _>  0. 85) . 

In  order  to  find  the  (w,x)  which  results  in  the  minimum  end-to-end  delay, 
we  trace  the  A*( =  0. 85) -countour  and  find  the  T*-contour  that  is  tangent 
to  it.  (In  Fig.  4.16  the  T-contour  of  18.5  is  such  a  contour,  which  is 
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tangent  to  the  X* (=  0.85) -contour  at  w=21  and  r/X^  =  73.)  Any  (w,t) 
other  than  (2J.73)  results  in  a  higher  delay,  hence  this  pair  is  the 
best  operating  point.  The  loci  of  these  tangent  points  (or  the  best 
(w,t)  combinations)  is  shown  in  Fig.  4.16  as  curve  C.  Note  that  we  can 
use  a  smaller  w  and  still  have  the  same  throughput  (e.g.,  intersection 
of  X*( =  0.85)-contour  and  curve  A);  however,  the  delay  at  this  point  is 
larger  than  18.5.  Considering  the  fact  that  the  window  size  represents 
the  buffer  size  of  the  traffic  controller,  this  latter  operating  point 
(with  a  smaller  window  size)  may  be  a  better  choice,  as  it  results  in  a 
lower  network  (buffer)  cost.  Fig.  4.16  can  be  used  in  an  analogous  way 
in  situations  where  the  end-to-end  delay  is  required  to  be  no  more  than 
a  given  maximum  value.  Similar  to  the  previous  case,  the  best  operating 
(w,t)  value  (which  results  in  a  maximum  throughput)  is  found  by  locating 
the  intersection  of  the  T*-contour  and  curve  C. 

This  concludes  our  analysis  of  static  flow  control.  In 
Chapter  5  we  will  introduce  a  dynamic  control  which  uses  the  properties 
of  this  static  control  and  which  is  capable  of  adjusting  the  window  size 
according  to  the  stochastic  fluctuations  of  the  load  in  the  destination 
buffer. 


4.5  Conclusion 

Based  on  a  window  mechanism,  we  developed  three  models  for 
static  flow  control  in  store-and-forward  computer  communication  networks. 
In  all  of  the  models  the  assumption  was  that  the  sending  user  is  fast 
compared  to  the  network  and  always  has  a  message  to  transmit;  therefore, 
throughput  is  mainly  determined  by  the  network.  Throughout  our  study 
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we  were  concerned  with  the  "maximum"  throughput-delay  behavior  of  a  net¬ 
work. 

We  first  studied  a  fluid  approximation  model  in  which  there  is 
no  stochastic  fluctuation  in  the  load  on  the  system.  This  model, 
although  simple,  shows  that  the  traffic  and  throughput  of  a  network  can 
be  controlled  by  a  window  mechanism  and  that  it  is  not  beneficial  to 
increase  the  window  size  beyond  a  certain  value. 

We  then  developed  a  simple  stochastic  model  by  improving  the 
accuracy  of  the  deterministic  model  by  permitting  some  fluctuations  in 
the  arrival  process  and  message  lengths.  By  assuming  the  arrival  proc¬ 
ess  of  messages  to  the  network  to  be  Poisson,  and  message  lengths  to  be 

exponentially  distributed,  closed  form  expressions  for  maximum  through¬ 
put  and  delay  of  a  network  were  derived.  In  order  to  determine  the  best 
window  size  to  be  used,  we  used  power  (defined  as  the  ratio  of  through¬ 
put  and  delay  [GIES  76])  as  the  measure  of  performance;  for  this  meas¬ 
ure,  it  was  found  that  the  best  window  size  for  this  simple  stochastic 
model  is  the  same  as  the  one  determined  by  the  deterministic  model. 
Because  of  some  simplifying  assumptions  regarding  network  reliabili- 
ity,  this  model  does  not  reveal  the  contention  phenomenon  (where  the 
throughput  of  a  network  degrades  as  its  traffic  increases) ;  however,  it 
is  very  useful  in  that  it  provides  us  with  a  closed  form  solution  for 
the  throughput  and  delay.  In  Chapter  6  we  use  these  closed  form  solu¬ 
tions  to  develop  a  network  algebra  for  systematic  studies  of  intercon¬ 
nections  of  networks. 

Finally,  by  making  further  assumptions  regarding  the  acknowl¬ 
edgement  and  timeout  strategy,  the  (sub)network  delay  distribution  and 
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the  destination  node  structure,  we  developed  a  generalized  stochastic 
model  for  static  flow  control.  It  was  shown  that  for  a  fixed  window 
size,  if  the  timeout  period  is  either  too  large  or  too  small,  the 
throughput  degrades  to  zero.  In  fact,  for  a  given  window  size,  there 
is  a  value  for  timeout  which  optimizes  the  throughput.  This  model 
explicitly  shows  the  effect  of  contention  in  that,  when  the  timeout 
period  is  fixed,  as  the  window  size  (or  the  network  traffic)  increases, 
the  throughput  first  increases  and  then  degrades  to  zero.  By  studying 
the  throughput-delay  performance  of  networks  (when  traffic  is  con¬ 
trolled  by  window  mechanism),  we  found  the  optimal  network  performance 
curve;  the  minimum  envelope  of  the  throughput-delay  equilibrium  for  fixed 
values  of  timeouts  and  window  sizes.  A  number  of  design  problems  were 
proposed  and  the  optimal  solution  to  each  was  given  through  graphical 
presentation. 

In  the  next  chapter,  by  using  the  results  of  static  flow  con¬ 
trol  analysis,  we  develop  and  study  a  dynamic  flow  mechanism  in  which 
the  window  size  and  timeout  are  dynamically  adjusted  to  the  stochastic 
fluctuations  in  the  load  on  the  system. 


CHAPTER  5 


DYNAMIC  FLOW  CONTROL  IN 
STORE-AND- FORWARD  COMPUTER  NETWORKS 

In  static  flow  control  (studied  in  the  last  chapter) ,  parameters 
of  the  control  mechanism  are  not  dynamically  adjusted  to  the  network 
load.  Due  to  finite  buffer  size  at  the  destination  node,  messages  may 
be  rejected  at  this  node  and  have  to  be  retransmitted.  The  extra  net¬ 
work  traffic  due  to  retransmissions  results  in  a  high  round-trip  delay, 
therefore  causing  frequent  occurrences  of  timeout,  and  consequently  more 
retransmissions;  thus  we  see  the  potential  for  a  dangerous  positive  feed¬ 
back  situation.  We  notice  that  if  there  were  a  further  control  to  "slow 
down"  the  input  of  new  messages  to  the  network  by  shrinking  the  window 
size  when  the  destination  buffer  begins  to  fill,  then  fewer  messages 
would  be  rejected  (or  lost)  at  the  destination  node,  and  thus  fewer 
retransmissions  would  be  necessary.  This  is  the  idea  behind  the  dynamic 
flow  control  mechanism  which  we  develop  and  study  in  this  chapter. 

We  consider  a  mechanism  located  at  the  destination  node,  which 
supervises  the  buffer  occupancy  at  this  node;  let  us  call  this  mechanism 
a  traffic  director  (TD) .  At  the  disposal  of  the  TD  there  are  a  number 
of  different  window  sizes  from  which  it  can  choose.  When  the  occupancy 
of  the  buffer  is  high,  the  TD  signals  the  traffic  controller  (TC)  at  the 
source  node  (via  a  special  control  packet)  to  change  its  window  size  to 
a  smaller  one;  when  the  occupancy  is  low,  the  TD  notifies  the  TC  to 
enlarge  the  window  size.  This  notification  is  done  by  control  packets 


which  are  sent  from  the  TD  to  the  TC  of  the  source  node.  The  window 
size  used  by  the  TC  at  the  source  node  determines  the  input  rate  to  the 
network;  therefore,  the  above  mechanism  dynamically  controls  the  input 
flow  to  the  network  (the  assumption  is  that  the  sending  user  always  has 
a  message  to  transmit,  hence  the  throughput  is  determined  by  the  network 
constraints,  see  Section  4.1).  To  prevent  excessive  overhead,  control 
packets  are  piggybacked  (similar  to  acknowledgements)  on  messages  which 
are  being  transmitted  from  the  destination  node  to  the  source  node;  only 
if  there  is  no  such  message  is  a  control  packet  sent  by  itself. 

A  decision  table  is  stored  at  the  destination  node  which  speci¬ 
fies  the  best  window  sizes  to  be  used  for  different  states  of  the  network. 
The  TD,  by  knowing  the  state  of  the  network,  can  therefore  use  the  deci¬ 
sion  table  to  decide  upon  the  best  window  size  it  should  send  to  the  TC. 
The  questions  which  arise  at  this  point  are: 

How  is  a  network  state  specified? 

What  is  meant  by  the  best  window  size? 

How  is  the  decision  table  set  up? 

In  order  to  answer  these  questions  we  develop  a  mathematical 
model  of  the  system  based  on  Markov  decision  theory.  We  model  the  des¬ 
tination  node  as  a  finite  waiting  room  queueing  system  with  K  possible 
input  rates  (K  is  also  the  number  of  different  window  sizes;  see  below), 
for  which  we  want  to  find  an  optimal  policy  for  choosing  input  rates 
such  that  the  long  run  average  throughput  of  the  buffered  system  is  maxi¬ 
mal,  and  the  average  delay  is  minimal.  (As  we  will  see,  delay  is  not  due 
only  to  the  destination  node.)  After  defining  a  reward  criteria,  we 
derive  a  maximal  reward  rate  stationary  operating  policy  for  the  finite 
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queue;  therefore,  the  problem  we  pose  will  be  reduced  to  the  optimal 
control  of  a  finite  queue  with  variable  input  rates.  As  we  see  below, 
due  to  special  characteristics  of  the  system,  an  exact  solution  to  the 
mathematical  model  is  very  laborious;  therefore,  we  develop  a  heuristic 
solution  to  the  problem. 

In  Section  5.1  the  model  is  formulated  as  a  discrete-time, 
multiple-state  variable  Markov  decision  process,  which  we  will  refer  to 

A  A 

as  a  Tr-cycle-delay  decision  process  (T  is  the  round-trip  delay,  see 
Section  5.1.1).  A  heuristic  solution  to  this  Markov  process  is  presented 
in  Section  5.2.  In  Section  5.3  we  discuss  the  implementation  of  the 
heuristic  optimal  policy,  and  in  Section  5.4  we  further  elaborate  on  a 
1-cycle-delay  decision  process  which  we  use  to  generate  the  decision 
table.  Finally,  in  Section  5.5  we  present  some  numerical  results.  For 
completeness,  a  summary  of  results  from  Markov  decision  theory  and  a 
brief  description  of  the  policy- iteration  method  [HOWA  60]  are  presented 
in  Appendix  C.  The  material  in  this  appendix  is  taken  from  Howard 
[HOWA  60,  71],  Jewell  [JEWE  63]  and  Ross  [ROSS  70]. 


5.1  The  Model 

We  consider  the  destination  node  of  a  network  (Fig.  4.1)  as  a 
buffer  storage  of  size  B  in  front  of  a  transmission  line  with  capacity 
C 2  (bps) .  Messages  which  arrive  at  the  storage  (after  passing  through 
the  network)  are  buffered  (if  necessary)  and  are  then  transmitted  out  to 
the  receiving  user.  We  assume  that  the  message  lengths  are  exponentially 
distributed  and  let  1/y  be  the  average  message  length  (bits).  With 
these  assumptions,  the  transmission  times  of  messages  out  of  the  buffer 
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are  exponentially  distributed  with  a  rate  of  y =  yC2.  Messages,  after 
passing  through  the  network,  arrive  at  the  buffer.  For  the  arrival 
process  we  make  the  following  assumption: 

Assumption  5.1 

The  counting  process  of  message  arrival  to  the  buffer  is 
assumed  to  be  Poisson  with  an  arrival  rate  X  e  A  (msg/sec) ,  where 
A  =  {Xj,  X2,  ....  X^}  is  the  set  of  possible  arrival  rates. 

For  reasons  which  will  be  clear  shortly  we  study  a  discrete-time 
model  with  very  small  time  increments,  and  observe  the  system  at  minis- 
cule  time  intervals  6  (e.g.,  6  =  4  *  10  sec  =  4  msec).  The  discrete 
approximation  to  the  continuous  case  causes  some  inconvenience  when  we 
describe  the  state  transition  probabilities.  We  assume  that  at  most  one 
departure  and  one  arrival  may  occur  at  each  time  slice  (of  6  seconds). 
This  means,  at  the  beginning  of  each  time  interval  a  new  message  may 
arrive  in  the  buffer  with  probability  X6  (the  result  of  a  Bernoulli 
trial);  thus  the  average  number  of  arrivals  per  second  is  X.  The  trans¬ 
mission  times  of  messages  are  chosen  independently  from  a  geometric  dis¬ 
tribution  such  that,  for  y2  =  y2  6  <  1 

sn  =  y2(!  -  Y2)n_1  n=l,  2,  ...  (5.1) 

where  s^  is  the  probability  that  a  message  transmission  time  is  exactly 
n  time  units  long  (i.e.,  that  its  service  time  is  n  6  seconds).  The 
average  transmission  time  is  therefore 

a  1  6  1 

X-  =  —  time  units  =  —  sec  =  —  sec  (5.2) 

y2  12 
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Regarding  the  order  in  which  the  events  take  place  at  the  end  of  a  time 
slice,  we  assume  that  departures  take  place  before  arrivals.  This  means 
that  a  message  which  has  completed  its  transmission  leaves  the  system, 
and  then  a  message  which  has  arrived  is  allowed  into  the  buffer.  (If 
the  buffer  is  full  and  there  is  no  departure,  then  the  arriving  message 
is  lost  and  must  be  retransmitted  from  the  source.) 

Fop  notational  purposes  for  the  discrete  time  version  we  use 

j  /v  /v 

variables  upder  a  circumflex  ;  in  particular,  we  let  X^=Xj(6,  X  =  X6, 
and  ^2  =  y2^‘  round-trip  delay  is  designated  by  the  random  variable 

/v 

?r  with  an  average  and  we  let  Tf  =  Tr/<5.  For  the  one-way  delay,  the 
corresponding  notation  will  be  f,  T  and  T  =  T/6. 


5.1.1  The  State  Space 

The  decision  on  the  appropriate  window  size  by  the  traffic 
director  (TD) (hence  on  the  input  rate  to  the  buffer  storage;  see  below), 
should  be  based  basically  on  the  occupancy  of  the  buffer.  Having  decided 
on  the  new  window  size,  through  the  control  signalling  process  described 
earlier,  the  TD  notifies  the  traffic  controller  (TC)  of  this  decision. 
When  the  TC  receives  this  notification  (after  the  one-way  delay  of  i 
seconds) ,  it  adjusts  the  window  and  the  input  rate  to  the  network  is 
changed  corresponding  to  the  new  window  size.  We  assume  that  this 
change  of  input  rate  is  instantaneous.  The  diagram  in  Fig.  5.1  shows 
the  evolution  in  time  of  the  input  rate  to  the  network.  In  this  figure, 
the  solid  line  is  the  input  rate  we  use  in  our  analysis  and  the  dashed 

181 


n 

/ 

/ 

/ 

curve  represents  an  approximation  to  the  actual  input  rate  as  a  function 
of  time.  It  is  possible  to  use  the  dashed  curve  and  analyze  the  system 

I 

as  a  queue  with  time-varying  arrival  rates  [NEWE  68,  71];  however,  due 
to  the  complexity  of  the  resulting  model,  we  choose  to  use  the  input 
rate  with  instantaneous  changes. 

After  the  input  rate  is  changed,  messages  which  are  admitted 
with  the  new  rate  reach  the  destination  node  after  t  seconds.  There¬ 
fore,  there  is  a  (random)  gap  from  the  time  the  TD  decides  upon  a  new 
input  rate  until  the  result  of  this  input  rate  becomes  effective  at  the 
destination  node.  When  this  time  gap  is  random,  the  resulting  model 
becomes  extremely  difficult  to  analyze  and  in  fact,  no  such  analysis  has 
been  done  in  the  literature.  To  render  the  model  amenable  to  analysis, 
we  further  assume  that  the  time  gap  between  the  moment  that  a  decision 
is  made  until  the  time  it  becomes  effective  is  a  constant  interval  of 
duration  Tr  seconds,  the  average  round-trip  delay.  Note  that  due  to  the 
fact  that  the  input  rate  to  the  network  varies,  the  average  round-trip 
delay  fluctuates  as  well;  however,  we  will  take  Tf  as  the  average  of 
round-trip  delays  due  to  different  possible  choices  of  window  sizes  (or 
input  rates) . 

With  the  above  considerations,  we  note  that  the  decision  at 

time  t  becomes  effective  at  the  destination  node  at  time  t  +  T  .  Con- 

r 

sidering  the  fact  that  a  decision  on  window  size  is  basically  based  on 
the  buffer  occupancy,  the  decision  at  time  t  should  be  based  on  the 
occupancy  of  the  buffer  at  time  t  +  T  .  Thus  the  state  description  at 
time  t  should  contain  information,  not  only  regarding  the  buffer  occu¬ 
pancy  at  time  t,  but  also  the  input  rates  to  the  buffer  in  the  interval 
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(t,  t  ♦ T  )  .  (These  input  rates  have  been  decided  upon  in  the  interval 
(t-Tr»  t).)  In  order  to  include  this  information  in  the  state  variable, 
we  r.  nsider  a  discrete  time  model  in  which  the  system  is  observed  every 

A 

6  seconds,  and  when  a  decision  is  made  it  becomes  effective  after  Tr 
( =  Tr/6)  time  units.  In  the  discrete  time  model  we  designate  time  by 
t;  therefore  time  t  (in  seconds)  is  identical  to  t  (  =  t/6  )  measured  in 
a  time  unit  equal  to  6.  With  the  above  considerations,  the  state  of 
the  system  at  time  t  is  represented  by  X(t),  where 


X(t)  =  {n(t) ,  X(t.l).  X(t,2),  ....  X(t,Tr-l)}  (5.3) 


where  0  £  n(t)  <  B  is  the  buffer  occupancy  at  time  t  and  £(t,k)  (mea- 

A 

sured  in  msg/6sec)  is  the  input  rate  to  the  buffer  at  time  (t  +  k) , 

A  A  A 

l<^k<Tr>  Note  that  X(t,k)  is  the  result  of  the  decision  which  was  made 

A  A 

Tr  -  k  time  units  earlier  than  t.  We  may  also  label  the  states  by  the 
integers  0,  1,  ...,  N,  where  (N+l)  is  the  size  of  the  state  space;  in 
equilibrium  a  state  i  is  defined  as 


~1  ~2  *.Tr-l 

i  ={n.,  X*,  xf ,  ....  X.r  } 


0  <  i  <  N 


(5.4) 


The  set  of  all  states  in  the  state  space  will  be  denoted  by  S.  The 
cardinality  of  the  set  S,  |s| ,  is 

A 

T  -1 

|S|  =  N  +  1  =  (B+l)xK  r 


where  K  is  the  number  of  different  window  sizes. 

As  a  notational  convention  we  use  the  symbol  "«•-*>",  to  connect 

A 

a  state,  say  X(t)  to  its  label,  i.e., 


X(t)  <“>  i 


(5.5) 
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if 


n(t)  =  iu  (5.6-a) 

and 

A(t,k)  =  A*  1  <  k  <  Tr  (5.6-b) 

5.1.2  The  Action  Space 

We  consider  a  finite  set  of  window  sizes  W  =  (w^,  . ..,  w^} 

at  the  disposal  of  the  TD.  From  our  earlier  analysis  in  Chapter  4,  we 
know  that  for  a  given  network  and  for  a  window  size  w,  a  timeout  x*  can 
be  chosen  such  that  the  throughput  of  the  network  becomes  maximal. 

Hence,  when  the  TD  decides  upon  a  window  size,  it  also  determines  the 

appropriate  timeout  xetx^,  x^,  _ ,  x^}  (or  (x^,  x^,  _ ,  x^})  and 

notifies  the  1C  in  the  source  node  (in  the  manner  described  above).  The 
set  W  is  the  set  of  actions  in  our  model;  however,  because  each  (w|c»Tjc) 
determines  an  input  rate  (see  Chapter  4),  we  might  as  well  assume  that 
the  corresponding  set  of  input  rates  A  =  (A^,  ...,  A^J  (or  (A^,  A2> 

...,  A^},  where  A  =  6A  )  is  the  set  of  actions.  In  the  discussion  below 
we  will  refer  to  either  of  these  two  sets  (V  or  A)  as  the  action  space, 
i.e. , 

A  =  A  or  A  =  W  (5.7) 

Corresponding  to  the  above  sets  of  input  rates,  timeouts  and  window 
sizes,  there  is  a  set  of  network  delays  T  =  (T^,  T2,  ...,  T^}  or 
{Tj,  T2,  ...,  T^}  where  T^  is  the  network  delay  when  w^  is  being  used. 


~  i®  £S 
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5.1.3 


The  Policy  Space 

A  policy  specifies  a  window  size  (or  equivalently  an  input 
rate)  which  should  be  sent  to  the  TC.  More  precisely,  by  a  policy  f  we 
mean  a  decision  rule  that  says,  given  that  the  system  is  in  state  i, 

A 

then  we  should  use  X^  (or  w^)  as  the  input  rate  to  the  buffer  which  will 

A 

be  effective  Tr  time  units  later,  that  is, 

f(i)  =  k  (5.8) 

We  are  implicitly  using  policies  that  are  functions  of  the  present  state 
of  the  system;  policies  in  this  class  are  known  as  "stationary  policies" 
and  will  be  denoted  by  F . 

5.1.4  The  State  Transition  Probabilites 

A  /V 

Let  the  system  states  at  time  t  and  t  +  1  be 

X(t)  =  {n(t) ,  X(t,l) ,  X(t,2) ,  ....  X(t,Tr-l)}  (5.9) 

and 

X(t+1)  =  {n(t+l) ,  X(t+l,l),  X (t+1 , 2) , 

....  X(t>l,Tr-2),  X(t+l,Tr-l)}  (5.10) 

where  X(t,k)  is  the  input  rate  at  time  t+k.  The  input  rate  that  the  TD 

A  A  A 

decides  at  time  t  to  be  effective  at  time  t+T^,  becomes  part 

/\ 

of  the  state  vector  at  time  t+1;  therefore  the  components  of  state 
vectors  X(t)  and  X(t+1)  are  related  to  each  other  as  follows: 

X(t+l,k)  =  X(t,k+1)  1  <  k  <  Tr-2  (5.11) 

A  A  A 

The  last  component  of  state  vector  X(t+1),  X(t+l,Tr-l),  is  the  decision 
which  is  made  at  time  t.  The  relationship  between  the  input  rate 
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components  of  two  consecutive  state  vectors  is  shown  in  Fig.  5.2.  The 

A  A 

number  of  messages  in  the  buffer  at  time  t+1  is  different  from  n(t); 
this  is  the  result  of  the  possible  arrival  of  a  message,  with  probability 

A  A  A 

X(t,l),  or  the  possible  departure  of  a  message,  with  probability  y2* 

With  the  above  considerations,  the  equilibrium  transition  probabilities 

A 

12  T  -1 

under  policy  f^between  states  i  =  {n^,  X^,  X^,  ....  X^*  }  and  j  =  (n^ 


T  "1 

x}.  X2.  . . . ,  X.r  }  will  be 


as  follows: 


Let 

p^  (f)  =  Pr[X(t+l)  <—>  j  |  X(t)  <=>  i,  and  policy  f  eF  is  used]  0  <_  t 
then 


/\  1  A 

Xj(l-Y2) 


if  n.  =  n.  +  1 
3  i 


/S 1  A  A  1  A 

1-x!-y2  +  2XJy2 


if  n.  =  n. 
3  i 


if  0  <  n.  <  B 
i 


(1  -  aJ)y2 


if  n,  =  n.  -  1 
3  i 


PijCf)  - 


X1 

l 


a  | 

i  -  x! 

i 


if  n.  =  1 
3 


if  n.  =  0 
J 


(5.12-a) 


if  n.  =  0 

l 


1  -  y2  *  *Jy2 


A  1  A 

(1  -  aJ)Y2 


if  n.  =  B 
3 


if  n.  =  B -  1 
3 


if  n.  =  B 

l 


and 


Alf  a]/a|  A  A  ATr-1 

if  X.  =  X.  for  1 <  k  <  T  -  2  and  X., . .  =  X . 
31  -  -  r  f(i)  3 


p^ (f)  =  0  otherwise 


(5.12-b) 
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Mt.Tr-1) 


We  refer  to  the  matrix  of  transition  probabilities  under 
policy  f  by  P( f),  i.e.,  P(f)  =  (p^ff)}. 

5.1.5  The  Performance  Criteria  and  the  Reward  Function 

The  objective  of  our  analysis  is  to  find  an  optimal  policy 
which,  in  the  long  run,  maximizes  the  average  throughput  and  minimizes 
the  average  delay  of  the  system;  this  objective  is  reflected  in  our 
formulation  through  the  reward  criteria.  Here  again  we  are  faced  with 
incorporating  the  opposing  effects  of  throughput  and  delay  in  a  single 
objective  function.  In  Section  4.3,  faced  with  a  similar  problem,  we 
used  power  (the  ratio  of  throughput  and  delay)  as  the  measure  of  perfor¬ 
mance.  Power  is  a  good  (but  not  the  only)  measure  in  the  sense  that 
both  high  throughput  and  low  delay  characterize  the  efficiency  of  a 
system.  In  this  section,  however,  we  choose  another  performance  measure 
and  use  it  in  our  decision  problem.  Let  A  and  T  be  the  throughput  and 
the  corresponding  delay  of  the  system;  then  the  reward  for  this  system 
is 

R  =  aA  -  T  a  >  0  (5.13) 

Similar  to  power,  a  high  throughput  and  low  delay  result  in  a  large 
value  for  R,  which  reflects  a  measure  of  the  efficiency  of  the  network. 
The  coefficient  a  reflects  our  relative  preference  for  large  throughput 
with  respect  to  low  delay.  a=0  corresponds  to  a  situation  where  we 
cannot  accept  any  delay  (we  will  see  shortly  that  this  value  of  a 
results  in  zero  throughput  and  hence,  zero  delay).  The  other  extreme 
is  when  of00  ,  which  reflects  the  case  in  which  throughput  is  the  only 
measure  of  performance. 
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We  now  need  to  find  the  expected  immediate  reward  (the  reward 


to  be  expected  in  the  next  transmission)  for  each  state.  In  Appendix  C 
we  have  used  R^(f)  for  the  expected  immediate  reward  for  state  i  under 
policy  f.  As  we  will  see  shortly,  in  the  problem  at  hand  R^(f)  is 
independent  of  the  decision  made  in  state  i  (in  fact,  it  is  dependent 

A 

on  the  decision  made  Tr  time  units  earlier);  however,  for  notational 
consistency,  we  use  the  symbol  R^(f)  to  refer  to  the  expected  immediate 
reward  for  state  i  under  policy  f. 

r  A 1  a2 

Consider  state  i  =  in. ,  X. ,  X. ,  . . . ,  X.  L  A  transition  from 

1  x  1  1  l  1 

state  i  to  any  succeeding  state  (including  i)  causes  each  of  the  ni  mes¬ 
sages  in  the  buffer  to  suffer  one  unit  of  delay,  and  so  we  have 

R°  =  Reward  rate  due  to  buffer  occupancy  for  state  i 

=  -  nA  0  £  ni  £  B  (5.14) 

Once  a  message  is  admitted  to  the  buffer,  it  is  guaranteed  delivery  to 
the  receiving  user;  in  the  next  transition  a  message  may  arrive  with 
probability  X^  ,  therefore  if  n.  <  B 

R*  =  Reward  rate  due  to  throughput  for  state  i 

=  aX1  if  n.  <  B  (5.15) 

x  i 

~1 

On  the  other  hand,  when  a  message  arrives  with  rate  X^,  it  also  brings 

~1  '■'i 

in  an  amount  of  delay  equal  to  T\  (corresponding  to  X^)  which  it  has 

suffered  in  passing  through  the  network.  The  probability  that  a  message 
"1 

arrives  is  X^;  therefore 

R^  =  Reward  rate  due  to  network  delay  for  state  i 

=  -  X1.  T*  if  n.  <  B  (5.16) 

li  l 
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between  the  instant  a  decision  is  made  until  the  time  the  decision 
becomes  effective;  we  will  refer  to  this  process  as  a  Tr~cycle-delay 
Markov  decision  process.  Processes  similar  to  this  are  encountered  in 
inventory  problems,  where  there  is  a  lead-time  for  an  order  to  arrive 
[MILL  74],  [VEIN  66] . 

The  set  of  stationary  policies  in  our  decision  process  is 
actually  a  subset  of  the  set  of  all  policies  (see  Appendix  C  for  exam¬ 
ples  of  other  possible  policies).  Under  a  stationary  policy  f,  the 
average  reward  per  unit  time  is  designated  by  the  vector  3(f)  =  (g^f)), 
where  g^(f)  is  the  long  run  expected  reward  per  unit  time  given  that  the 


process  initially  starts  in  state  i.  3(f)  is  given  by 


l  n  * 

S(f)  =  lim  rfT  £  [P(f)]r/?(f) 


(5.20) 


where  [P(f)]*  is  the  t*h  power  of  P(f)  and  is  the  t-step  probability 
transition  matrix  under  the  stationary  policy  f.  If  f  is  a  policy  such 
that  P(f)  has  one  chain,  then  all  components  of  3(f)  are  eotual.  Our 
objective  is  to  find  a  (stationary)  policy  f*  which  maximizes  each 
component  of  the  expected  average  reward  per  unit  time. 


3(f*)  =  max  {3(f)} 
feF 


(5.21) 


It  can  be  shown  for  a  bounded  reward  function,  when  the  action  space  and 
the  state  space  are  finite,  that  such  a  stationary  policy  always  exists 
[BLAC  62],  [JEWE  63],  [ROSS  70],  [SOBE  74].  These  conditions  are  met 
in  the  problem  at  hand. 
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5.2  Solution  to  the  Problem:  The  Look-Ahead  Policy 

A 

The  Tr-cycle-delay  Markov  decision  process  formulated  in  the 
last  section  can,  in  principle,  be  solved  for  an  optimal  stationary  pol¬ 
icy  by  using  any  numerical  technique,  e.g.,  the  policy-iteration  method 
[HOWA  60].  Unfortunately,  for  any  realistic  set  of  parameters,  the  size 
of  the  problem  (especially  the  state  space)  becomes  too  large;  it  is 
extremely  laborious,  if  not  impossible,  to  solve  such  a  large  problem 
by  any  conventional  means.  As  an  example,  consider  a  system  in  which 

A 

the  round-trip  delay,  T  ,  is  0.2  seconds.  If  we  use  6  =  4  msec  as  our 
time  unit,  the  round-trip  delay  Tr  becomes  50  time  units.  Therefore, 
each  state  vector  has  50  components,  one  of  which  (the  first  one)  is  the 
buffer  occupancy,  and  the  remaining  49  elements  are  the  input  rates  for 
the  next  49  time  cycles.  Assume  the  buffer  size  is  10  (B=  10)  and  that 
there  are  5  different  window  sizes  to  use;  therefore,  the  size  of  the 
state  space  is 

|S|  =  11  x  549  =  1035 

Considering  the  fact  that  each  cycle  of  the  policy-iteration  method 
requires  the  solution  of  a  set  of  |S|  linear  simultaneous  equations 
(Appendix  C),  a  direct  solution  to  this  problem  is  out  of  the  question, 
hence  we  must  resort  to  a  heuristic  solution.  To  this  end  we  consider 
the  following  points: 

1.  The  major  difficulty  arises  from  a  long  round-trip  delay. 
For  a  round  trip  of  2  units  (T  =  2),  the  size  of  the  state  space  for  the 
above  example  becomes  |s|  11 x  5  =  55  and  when  Tr=  1,  it  is  only  11. 
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(T^ =  1  results  in  a  1-cycle-delay  Markov  decision  process  in  which  a 
decision  becomes  effective  in  the  next  transition;  this  is  the  case  with 
an  ordinary  Markov  decision  process  and  we  make  extensive  use  of  this 
case  in  our  heuristic  solution.) 

A 

2.  Consider  a  1-cycle-delay  decision  process  (Tr=l).  For 
this  system  the  only  information  necessary  to  determine  a  window  size 
is  the  buffer  occupancy;  in  fact,  in  this  case  the  state  of  the  system 
is  represented  simply  by  the  buffer  occupancy.  When  there  is  a  lead- 

A 

time  of  Tr >  1  for  a  decision  to  become  effective,  the  process  makes  use 
of  the  input  rate  components  of  a  state  vector  to  predict  the  future 
occupancy  of  the  buffer  at  (T  -  1)  time  units  later.  Based  on  this  pre¬ 
diction,  it  then  decides  on  a  window  size  (or  an  input  rate)  to  become 
effective  at  Tf  time  units  later.  In  fact,  if  at  time  t  we  provide  the 
decision  process  with  the  future  buffer  occupancy  at,  say  k,  time  units 

A 

later  (k<Tr),  then  from  the  decision  process  point  of  view,  the  lead- 

A 

time  for  a  decision  to  become  effective  reduces  to  T  -  k. 

r 


The  above  considerations  lead  us  to  develop  a  heuristic  solu- 

A 

tion  for  the  decision  process.  Consider  state  i  of  a  T^-cycle-delay 
Markov  decision  process, 


*1  *7 

i  =  {n.,  X1,  r.,  . 
•  i  i  i 


/vT  -1 

...  X.r  } 


(5.22) 


The  component  X^  in  the  state  vector  is  the  input  rate  to  the  buffer  at 
k  time  units  after  the  system  is  in  state  i.  By  knowing  the  future 


input  rates  to  the  buffer,  we  can  find  the  expected  occupancy  of  the 


buffer  at  T^  time  units  later  <  Tj.) ;  wc  refer  to  T^  as  the 
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look-ahead  time  and  designate  the  expected  buffer  occupancy  after 

__  A 

transitions  from  state  i  by  n.  (  +  T^) .  When  the  set  of  policies  and 
state  transition  probabilities  are  properly  defined,  the  vector 


1  = 


{W’ 


/sTo  +  1 

X  16  , 


JTo+2 
X  * 


*fr-l 

x  r  } 


(5.23) 


specifies  a  state  of  a  (T  -  T^) -cycle-delay  decision  process.  We  desig¬ 
nate  the  set  of  states  of  this  process  by  S'.  For  this  process,  a 
(stationary)  policy  f'  is  defined  as  a  function  from  the  state  space  S' 
to  the  action  space  A  defined  by  Eq.  (5.7),  i.e.,  f ':S'+A.  If  |S'|  is 
not  too  large,  we  can  find  an  optimal  stationary  policy  f"*  for  this 
process,  and  use  {'*  to  find  a  (sub)optimal  stationary  policy  f*  for 

A 

the  original  Tr-cycle-delay  decision  process,  so  that 


f*(i)  =  f  *(i  ) 


(5.24) 


where  i'  is  given  by  Eq.  (5.23)  The  expected  reward  rate  (and  the  long 
run  average  throughput)  for  policy  f*  is  very  close  to  the  optimal 
expected  reward  rate  for  the  original  process.  (The  numerical  results 
in  Section  5.5  support  this  claim.)  We  refer  to  policy  f*  as  the  opti- 

a  /\ 

mal  T^- look-ahead  policy,  where  T^  is  the  look-ahead  time.  We  summarize 
the  above  procedure  in  the  algorithm  given  below. 


Algorithm  5.1:  The  Look-Ahead  Policy 


/\ 

1.  For  the  Tr-cycle-delay  Markov  decision  process,  find  a 
look-ahead  interval  0<.T^<T  ,  such  that  the  resulting 

A  Ak 

(Tf  -  T^) -cycle-delay  Markov  decision  process  is  solvable. 
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2.  Use  any  appropriate  technique  (e.g.,  the  policy-iteration 
method.  Appendix  C)  to  find  an  optimal  policy  {'*  for  the 

A  A 

(T  -  -cycle-delay  decision  process. 


3. 


4. 


For  each  state  i  of  the  Tr-cycle-delay  decision  process 

_  A 

find  n^  ( +  T^)  and  the  corresponding  state  i  for  the 


(T  -  T.)-cycle-delay  decision  process,  where 
r  Jt 


i'  -  {n.(  +  T£),  A.*  ’,  A.*  _,  ....  A^1*1} 


VI  .V2 


The  (sub)optimal  decision  at  state  i  is  given  by  £* (i) 
where 

f*(i)  =  r*(n 


The  problem  which  remains  is  that  the  expected  future  buffer 

_  A 

occupancy,  n.  (  ♦  T.),  is  a  real  number,  whereas  for  the  solution  of  the 

1  Xf 

decision  problem  we  require  the  occupancy  to  be  an  integer.  We  can 

_  /A 

bypass  this  problem  by  rounding  n^  (  ♦  T^)  to  its  nearest  integer.  The 
above  heuristic  solution  plays  a  crucial  role  in  making  possible  the 
use  of  the  policy-iteration  method  to  solve  optimization  problems  involv¬ 
ing  long  round-trip  delays  and  a  large  number  of  possible  window  sizes 
to  choose  from. 

In  order  to  use  the  above  algorithm  for  the  (sub) optimal  selec¬ 
tion  of  window  size,  we  must  find  n^  (  +’  T^) .  This  is  the  subject  of  the 


next  section. 


5.2.1  Calculation  of  the  Expected  Buffer  Occupancy 

For  notational  simplicity  in  the  derivations  below,  instead  of 
referring  to  a  state  of  the  system  by  its  label  i  (e.g.,  Eq.  (5.22),),  we 
choose  to  specify  the  state  of  the  system  at  time  t  by  X(t), 

X(t)  =  |n(t) ,  A(t,l) ,  A(t,2) ,  ....  A(t,Tr-l)}  (5.25) 

where  n(t)  is  the  buffer  occupancy  at  time  t  and  £(t,k)  (  =  X(t  +  k)) 
is  the  input  rate  to  the  buffer  at  time  (t  +  k).  The  expected  buffer 

A  _  A 

occupancy  at  time  (t  +  k)  will  be  designated  by  n(t  +  k);  therefore,  if 
X(t)  <=>  i  (see  Eqs.  (5.5)  and  (5.6)),  then 

n(t  +  k)  =  n.(  +  k)  (5.26) 


Consider  the  state  of  the  system  at  time  t  given  in  Eq.  (5.25). 

A  A  A 

In  order  to  find  the  expected  buffer  occupancy  at  time  t  +  (where 
is  the  look-ahead  time),  we  need  to  know  the  equilibrium  probability 

A  /V 

distribution  of  the  number  of  messages  in  the  buffer  at  times  t+1,  t+2, 

A  A 

....  t+T^.  This  can  easily  be  done  by  considering  the  buffer  system  as 
a  (discrete  time)  birth-death  process  [KLEI  75]  in  which  the  death  rate 
is  y2>  the  birth  rate  at  time  t  +  k  is  A(t  +  k),  and  at  most  one  birth 
and/or  one  death  can  occur  at  each  time  unit.  Let 


Pj  (t  +  k)  =  Pr[at  time  t+k  there  are  j  messages  in  the  buffer] 

°lk<^r  (5.27) 

Because  there  are  n(t)  messages  in  the  buffer  at  time  t,  then  trivially 


j  0  j  t  n(t) 

(  1  j  =  n(t) 


(5.28) 
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Let  l^k^T^  and  t'  =  t  +  k-1,  then 


[1  -  X(t.k)]PQCt-)  ♦  [1  -  A(t,k)]Y2P1(t')  if  j  =  0 

X(t,k)PQ(t')  *{[l-X(t  +  k)](l-  y2)  +  Mt  +  kjY^PjftO 


P.  (t  ♦  k)  = 

J 


♦  [l-X(t,k)]Y2P2(0 


if  j  =  1 


A(t.k)  (1  -  i  [1  -  Mt  ♦  k)  ]  (1  -  y2)  ♦  X(t  ♦  k)Y2}Pj(tO 

+  [1  -  A(t  +  k)]Y2Pj+1(t')  if  1  <  j  <  B 


A  A 


X(t,k)(l-  Y2)PB.j(t  )  +  [1  -  Y2+  XCt*  k)Y2]PB(t')  if  j  =  B 

I 

(5.29) 

Using  these  occupancy  probabilities,  the  average  number  of 
messages  in  the  buffer  can  be  calculated  as  follows: 

B 

n(t  +  k)  «  £  P  (t*k)j  0  1  k  <  T 

j  =  l  3 

and  we  have 

n(t)  =  n(t)  k  =  0  (5.30) 


and 

n(t  +  k)  =  n(t  +  k-  1)  +  A(t,k)[l-  (1  -  Y2)Pfi(t  ♦  k  -  1)] 

-  y2[l-P0(t+k- 1)]  0<k<TJl 

(5.31) 

The  first  bracket  on  the  right-hand  side  of  Eq.  (5.31)  is  the 
probability  that  at  time  (t  +  k)  a  message  can  be  accepted  into  the 

A  A 

buffer.  At  this  time  a  message  arrives  with  probability  A(t,k);  there¬ 
fore,  the  first  term  is  the  increase  in  the  average  buffer  occupancy  at 
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time  (t  +  k)  with  respect  to  the  average  occupancy  at  time  (t  +  k -  1). 

A 

The  second  bracket  is  the  probability  that  at  time  (t  +  k  -  1)  the  buffer 
is  not  empty.  If  that  is  the  case,  a  message  may  leave  the  buffer  with 

A 

probability 

We  can  use  Eqs.  (5.28)  through  (5.31)  iteratively  to  find  the 
expected  buffer  occupancy  at  time  t +  T^.  Once  n(t +  T^)  is  calculated, 
it  is  rounded  to  its  closest  integer  and  is  used  in  Algorithm  5.1. 

5.3  Implementation  of  the  Look-Ahead  Policy  -  The  Decision  Table 

We  have  so  far  answered  two  of  the  three  questions  we  addressed 
in  the  opening  remarks  of  this  chapter.  The  state  of  the  system  (net¬ 
work  and  the  destination  buffer)  was  defined  as  a  vector,  the  components 
of  which  are  the  buffer  occupancy  and  the  future  input  rates  to  the 
destination  buffer.  After  defining  a  reward  function,  we  gave  a  heur¬ 
istic  solution  for  the  optimal  policy  to  select  the  best  window  size  for 
each  state.  In  this  section  we  address  the  third  question  and  describe 
the  structure  of  the  decision  table,  how  it  is  set  up,  and  how  it  is 
used  by  the  TD. 

In  Section  5.2  we  discovered  how  a  (sub)optimal  stationary  pol- 

A  > 

icy  for  the  Tr-cycle-delay  Markov  decision  process  can  be  found  heur- 

A  /V 

istically  by  using  the  optimal  policy  for  the  corresponding  (T  -  T^)- 
cycle-delay  decision  process.  Once  an  optimal  policy  for  this  smaller 
size  decision  process  is  found,  we  can  set  up  a  decision  table  which 
has  one  entry  for  each  state  of  the  (T  -  T^) -cycle-delay  decision 
process.  For  each  state,  the  corresponding  (optimal)  window  size  is 
listed  in  the  table;  therefore 
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(5.32) 


Size  of  the  decision  table  = 


(B  +  1)  K 


(Tr-Trl) 


where  B  is  the  buffer  size,  K  is  the  number  of  different  window  sizes, 

A  A 

is  the  round-trip  delay,  and  is  the  look-ahead  time.  The  TD  always 
stores  the  current  state  vector  of  the  system;  at  time  t  this  vector  is 
as  follows: 


X(t)  =  (n(t),  X(t,l),  X(t,2),  ...,  X(t,T.-l)} 

Each  time  the  TD  wants  to  decide  on  a  window  size,  it  calculates  the 
expected  occupancy  of  the  buffer  at  time  t  +  f^,  n(t  +  T^),  and  sets  up 
the  following  vector: 

X'(t)  =  {n(t  +  f£),  A(t,T£+l),  ....  X(t,Tr-l)}  (5.33) 

(In  fact,  the  expected  buffer  occupancy  should  be  rounded  to  its  nearest 
integer.)  X'(t)  is  a  state  vector  for  the  (T  -T.) -cycle-delay  decision 

I*  J6 

process,  and  the  window  size  for  X^(t)  in  the  decision  table  is  the  one 
that  the  TD  should  use.  In  Figure  5.3  we  have  shown  the  decision  table 
when  T„  =  T  -  1,  and  the  destination  buffer  size  is  6.  In  this  case 
the  decision  table  specifies  an  optimal  policy  for  a  1-cycle-delay 
decision  process  in  which  a  decision  becomes  effective  on  the  next  tran¬ 
sition  (which  is  the  case  for  an  ordinary  Markov  decision  process); 
therefore,  the  state  of  the  process  is  simply  the  buffer  occupancy.  A 
decision  is  an  optimal  window  size  for  a  state.  Note  that  when  the  buf¬ 
fer  occupancy  is  5  or  6,  the  optimal  window  size  is  0,  i.e.,  for  the 
particular  value  of  a  which  is  used  for  this  table,  the  system  is  closed 
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to  further  input  when  the  occupancy  is  5  or  6.  (See  Table  5.1  for  other 
examples  of  a  decision  table.) 


State 

Decision 

(buffer  occupancy) 

(window  size) 

0 

4 

1 

3 

2 

2 

3 

1 

4 

1 

5 

0 

6 

0 

Figure  5.3  Example  of  a  Decision  Table 

For  Fig.  5.3,  X'(t)  given  by  Eq.  (5.33)  reduces  to  a  single 
number,  i.e., 

X'(t)  =  ntt  +  fy  =  n(t  +  Tr-  1) 

Ak 

The  choice  of  the  look-ahead  time,  T^,  should  be  based  on  two 
considerations:  optimality  of  the  decisions  made  by  the  TD  and  feasi¬ 
bility  of  solving  the  reduced  decision  process.  For  a  small  T^,  the 

A 

decisions  made  by  the  TD  are  closer  to  the  optimal  (note  that  for  T^ =  0, 
the  TD's  decisions  are  optimal);  however,  as  we  pointed  out  earlier,  the 

A  A 

resulting  (T  -  Tc) -cycle-delay  decision  process  is  too  large  to  solve, 
r  Jt 

and  also  the  size  of  the  decision  table  becomes  too  large,  see  Eq.  (5.32). 

A 

On  the  other  hand,  when  T^  is  large  the  result  of  the  decision  made  by 

A 

the  TD  may  be  far  from  optimal;  nevertheless,  a  large  T^  also  results  in 
a  small  decision  process  to  solve  and  a  small  decision  table.  The 
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largest  value  of  is  T  -  1,  for  which  the  state  of  the  reduced  decision 

A  A  A 

process  is  simply  the  buffer  occupancy.  For  T.  =  T  -2,  X'(t)  has  two 

jc  r 

components  which  corresponds  to  the  state  of  a  2-cycle-delay  decision 

process.  Although  this  system  is  not  too  large  to  solve  (the  size  of 

state  space  for  B=  10  and  K=  5  is  55),  a  problem  which  arises  is  that 

the  resulting  state  transition  probability  matrix  (Eq.  (5.12))  may  have 

more  than  one  irreducible  chain;  therefore,  the  solution  to  the  decision 

process  becomes  very  complex  and  time  consuming.  (The  policy  iteration 

algorithm  described  in  Appendix  C  is  for  single  chain  Markov  processes; 

for  multi-chain  processes  some  modifications  to  the  algorithm  should  be 

made,  see  [HOWA  60,  71].)  For  values  of  T„  less  than  T  -  2,  not  only 

*•  r 

does  that  state  space  become  large,  but  also  we  face  the  problem  of 

A  A 

multi-chain  systems.  For  these  reasons,  -  1  is  a  convenient 

choice,  and  we  have  used  this  value  of  in  our  numerical  examples.  To 

A  A  A 

support  this  choice  of  T^,  we  also  used  T^  =  Tf  -  2  and  found  that  the 
performance  of  the  network  (in  terms  of  throughput  and  delay)  was  almost 

A  A 

identical  to  the  case  when  T0  =  T  -  1.  In  the  next  section,  we  elab- 

Z  r 

orate  on  Markov  decision  processes  with  a  lead  time  equal  to  one  (i.e., 
a  decision  becomes  effective  on  the  next  transition) . 


/ 
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Tr-cycle-delay  processes  we  presented  in  Section  5.1  also  applies  to  a 
discrete  time  Markov  decision  process;  however,  because  we  have  used 

a  A 

T-  =  T  -  1  in  our  numerical  examples  (Section  5.5),  we  find  it  useful 
x  r 

to  elaborate  briefly  on  this  1-cycle-delay  process. 


The  state  of  the  process  is  the  number  of  messages  in  the 


buffer 


S  =  (0,  1,  ....  B) 


(5.34) 


where  B  is  the  buffer  size.  The  action  space  A,  as  defined  in  Section 
5.1.2,  is  the  set  of  window  sizes  W  (or  equivalently  input  rates  A)  that 
can  be  used  (Eq.  5.7).  Corresponding  to  each  window  size,  say  w^, 

/V  A 

there  is  a  network  delay  T^  and  a  timeout  period  (we  will  shortly 
describe  how  these  quantities  are  calculated).  A  (stationary)  policy  f 
is  a  function  from  the  state  space  S  to  the  action  space  A  (i.e.,  f:5-*v4) 

A 

such  that  if  f(i) =  k,  i e 5,  we  use  w^  (or  X^)  when  the  system  is  in 
state  i.  The  transition  probabilites  are  basically  similar  to  Eq.(5.12) 
We  use  p„(f)  to  indicate  the  transition  probability  from  state  i  to 
state  j  when  (stationary)  policy  f  is  used.  If  f(i)  =  k,  then 


VJ-V 


A  Ak  A  /\ 

1  "  *k  "  Y2  +  2XkY2 


Pij(f)  - 


U  -  Xk)y2 


(1  -  v 


A  A  A 

1  •  Y2  +  XkY2 

A 

U  -  Xk)Y2 


3  -  i*  1 


3  =  i 


3  ■  i-  1 


j  =  1 

j  =  0 


j  =  B 
j  =  B-  1 


0  <  i  <  B 


i  =  0 


i  =  B 


(5.35) 


We  denote  the  matrix  of  transition  probabilites  under  policy  f 
by  P(f) .  As  in  Section  5.1.5,  we  can  find  the  expected  immediate  reward 


/ 


rate  at  state  i  under  policy  f. 

!a  a  a 

■ (i  *  w 

“V2  -  (i*ikVl,Sk(1-,21,l] 


if  i  <  B  and  f  (i)  =  k 
if  i  =  B  and  f (i)  =  k 


We  refer  to  {R^f)}  by  P(f).  Our  objective  is  to  find  a  stationary 
policy  f*  which  maximizes  the  components  of  the  (B  +  l)-vector  of  the 
expected  reward  rates 

£(f*)  =  max  £(f) 
fcF 

where  g(f)  =  (g^(f))  is  the  vector  of  long  run  expected  rewards  per  unit 
time  when  policy  f  is  used  (Eq.  (5.20)).  It  can  be  shown  that  such  an 
optimal  policy  always  exists;  furthermore,  when  P(f)  has  one  irreducible 
chain,  then  all  components  of  g(f)  are  equal  and  we  can  use  the  algo¬ 
rithm  given  in  Appendix  C  to  find  this  policy. 

When  the  time  increment  6  is  very  small,  decisions  become  effec¬ 
tive  very  fast;  in  the  limit  as  6  +  0,  the  above  discrete  time  Markov 
process  becomes  a  continuous  time  process.  In  the  continuous  time  mode 
the  rate  of  transitions  (rather  than  the  probability  of  a  transition) 
out  of  a  state  is  a  function  of  the  action  taken  in  that  state.  Because 
in  this  process  decisions  become  effective  instantaneously,  the  optimal 
reward  rate  for  the  continuous  time  process  is  the  best  performance  we 
can  expect  from  the  system,  and  the  throughput  for  the  optimal  policy 
f*  is  the  highest  that  can  be  expected.  Therefore,  we  will  compare  the 
performance  of  our  look-ahead  policy  with  the  performance  of  this 
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continuous  time  model. 


Consider  a  continuous  time  Markov  decision  process  in  which 
actions  are  taken  according  to  a  stationary  policy  f;  in  this  case  the 
policy  f  specifies  the  input  rate  to  the  destination  buffer;  therefore 
the  input  rates  to  the  buffer  are  functions  of  the  states  as  shown  in 
Fig.  5.4.  For  this  system  we  can  find  the  distribution  of  the  equilib¬ 
rium  occupancy  probabilities  and,  in  particular,  the  average  input  rate 
to  the  system 


B-l 


wf>  *  S0 


(5. 36) 


Because  each  message  that  is  accepted  to  the  buffer  is  guaranteed  to  be 
delivered  to  the  receiving  user,  X.^(f)  is  also  the  long  run  throughput 
of  the  system  under  policy  f.  If  X^gj  (the  decided  input  rate  when  the 
buffer  is  full)  is  not  zero,  then  P^(f)  is  the  probability  of  loss  when 
window  sizes  (or  input  rates)  are  selected  under  policy  f 


P£  =  PB(f)  (5.37) 

As  we  see  shortly,  in  order  to  find  the  set  of  input  rates  A,  we  must 
know  this  probability. 

5.4.1  A  Procedure  to  Find  the  Set  of  Input  Rates 

So  far  we  have  assumed  that  for  a  set  of  window  sizes  W,  we 
can  use  the  results  of  our  static  flow  control  (Chapter  4)  to  find  the 
optimal  input  rates  (or  throughputs)  of  the  network  A=  (Xj,  X^,  ...,  X^}. 
In  Chapter  4  we  found  that  if  the  window  size  w  ,  channel  capacity 
and  the  loss  probability  P^  of  a  network  are  known,  then  an  optimal 
timeout  (t*)  can  be  found  such  that  the  throughput  of  the  network 
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becomes  maximal  (Eqs.  (4.14)  and  (4.16)).  The  problem  which  we  face 


here  is  that  the  loss  probability,  P^,  is  itself  a  function  of  the 
(optimal)  policy  that  we  use  to  select  window  sizes  (Eq.  5.37)).  On 
the  other  hand,  the  optimal  policy  is  determined  by  using  the  set  of 
input  rates  A;  therefore,  evaluation  of  input  rates  {X^}  depends  on  the 
loss  probability  P^,  which  is  itself  a  function  of  the  input  rates.  To 
remove  this  recurrence,  we  can  use  a  linear  iteration  method  given  in 
Algorithm  5.2. 

Algorithm  5.2 

1.  Initialize  P£  (e.g.,  P^O). 

2.  Find  the  set  of  input  rates  for  the  set  of  window  sizes  W. 

3.  Use  the  set  of  input  rates,  A,  in  the  policy- iteration 

method  to  find  an  optimal  policy  f*. 

4.  Find  the  new  loss  probability,  P'  =  Pfi(f*). 

5.  If  |P^  -  P'|  >  e  set  P^  =  P'  and  go  to  step  2. 

6.  Stop. 

We  start  the  procedure  by  initializing  the  loss  probability  P^ 
(an  initial  value  of  0  is  a  good  choice).  With  this  probability  of  loss 
we  can  find  the  set  of  optimal  input  rates  A  corresponding  to  the  set  of 
window  sizes  W.  Having  found  A  (and  the  corresponding  network  delays 
and  timeout  periods),  we  can  use  the  policy-iteration  method  to  find  an 
optimal  policy  f*  which  maximizes  the  reward  rate,  step  3.  Corresponding 
to  this  policy,  there  is  a  loss  probability  P',  Eq.  (5.37).  If  P'  and 
P^  are  not  close  enough  to  each  other,  we  repeat  steps  2  to  4  using  the 
new  loss  probability  instead  of  P^.  When  the  difference  between  P' 


and  is  small,  policy  f*  found  in  Step  3  is  the  "optimal"  policy 
(within  the  tolerance  e)  that  we  should  use  in  our  decision  table.  In 
all  of  our  numerical  examples  Algorithm  5.2  converged  in  less  than  six 
iterations;  however,  this  algorithm  is  not  always  guaranteed  to  termi¬ 
nate  -  in  a  narrow  sense  it  should  be  referred  to  as  a  procedure. 

5.5  Numerical  Results 

In  this  section  we  present  numerical  examples  for  dynamic  flow 
control,  study  the  optimality  of  the  look-ahead  policy,  and  investigate 
the  effect  of  the  parameter  a  on  the  throughput-delay  performance  of  a 
network.  As  in  Chapter  4,  throughout  our  presentation  we  consider  the 
normalized  value  of  certain  parameters;  the  normalization  constant  is 
Xj  =  1/CyCj),  the  average  transmission  time  of  a  message  on  a  network 
channel . 

We  begin  by  studying  the  throughput-delay  performance  of  a 
continuous  time  model  in  which  the  effect  of  a  decision  is  instantaneous. 
This  means,  based  on  the  occupancy  of  the  buffer,  when  a  new  window  size 
(or  input  rate)  is  decided  upon,  it  becomes  effective  instantaneously  in 
the  next  state  transition.  Therefore,  in  this  hypothetical  system  there 
is  no  time  gap  from  the  moment  that  a  decision  is  made  until  it  becomes 
effective.  Hence,  performance  of  the  continuous  time  model  is  the  best 
that  can  be  achieved. 

As  we  pointed  out  earlier,  the  value  of  a  reflects  our  relative 
preference  for  high  throughput  over  low  delay.  Table  5.1  shows  the 
effect  of  parameter  a  on  the  throughput -del ay  performance  of  the  system. 
In  this  table  the  destination  buffer  size  (B)  is  10,  the  channel 
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B  =  10 


Cl  =  C2  =  50  kbps 
W  =  {0,  1,  2,  ....  20}  1/U  =  1000  bits 


a  =  0.4 


a  =  0.6 


STATE 

PROB 

WINDOW 

0 

0.310 

6 

1 

0.232 

5 

i  2 

0.176 

5 

3 

0.128 

4 

4 

0.078 

3 

5 

0.047 

3 

6 

0.028 

2  j 

7 

0.014 

1 

8 

0.001 

0 

9 

0.00 

0 

10 

0.00 

0 

i  Average  window: 

4.77 

{  Average  buffer  occupancy:  1.78  I 

I  Average  throughput : 

34.51 

j  Average  buffer  delay:  0.05  | 

j  Average  network  delay:  0.07  f 

|  Average  total  delay 

0.12  | 

STATE 

PROB 

WINDOW 

0 

0.248 

8 

1 

0.199 

8 

2 

0.159 

7 

3 

0.123 

6 

4 

0.092 

5 

5 

0.066 

5 

6 

0.047 

4 

7 

0.031 

3 

1  8 

0.019 

2 

9 

0.009 

2  ! 

10 

0.004 

0 

|  Average  window : 

6.66 

Average  buffer  occupancy:  2.38  I 

j  Average  throughput : 

37.59 

Average  buffer  delay  0.06  j 

Average  network  delay:.  0.09  | 

j  Average  total  delay 

°.15  j 

Table  5.1.  Example  of  optimal  window  selection 
for  a  continuous  time  model. 
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capacities  of  the  network  and  destination  node  (C^  and  C^)  are  both 
50  kbps,  the  average  message  length  is  1000  bits,  and  the  set  of  allow¬ 
able  window  sizes  is  V  -  {0,  1,  2,  ...,  20}.  This  table  shows  the  opti¬ 
mal  policies  and  their  corresponding  statistics  for  a  =  0.4  and  a  =  0.6. 
It  is  seen  that  for  a  =  0.6,  larger  window  sizes  are  used  (and  a  higher 
throughput  results);  for  example,  when  the  buffer  is  empty  (state  0) 
for  a  =  0.4,  w=6  is  the  optimal  decision,  whereas  when  a  =  0.6,  w  =  8 
becomes  the  best  decision.  Although  the  delay  (and  throughput)  due  to 
w=8  is  larger  than  w  =  6,  a  window  size  of  8  becomes  the  optimal  deci¬ 
sion  because  a  =  0.6  puts  more  weight  on  high  throughput  than  a  =  0.4. 

It  is  seen  that  not  all  of  the  allowable  window  sizes  are  used  in  either 
of  the  above  cases;  therefore,  not  only  can  the  parameter  a  be  used  to 
control  the  throughput,  but  the  maximum  window  size  can  be  controlled 
also  by  a  proper  selection  of  a.  Considering  the  fact  that  the  maximum 
window  size  which  is  used  reflects  the  number  of  buffers  needed  at  the 
source  node,  the  importance  of  this  parameter  becomes  clear.  Further¬ 
more,  the  number  of  buffers  required  at  the  destination  node  is  also 
determined  by  a  (for  example,  for  a  =  0.4  a  buffer  size  of  B  =  8  is 
sufficient);  therefore,  this  parameter  can  also  be  used  to  control  the 
utilization  of  the  destination  node  buffer. 

Figure  5.5  shows  the  throughput-delay  (end-to-end  or  total 
delay)  performance  of  the  continuous  time  model  for  different  buffer 
sizes  for  the  destination  node.  The  set  of  allowable  window  sizes  and 
other  parameters  of  the  system  are  the  same  as  in  Table  5.1.  For  a 
fixed  B,  the  average  throughput  and  delay  increase  as  the  parameter  a  * 
increases.  For  a  value  of  a  an  optimal  decision  is  determined  such  that 
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NORMALIZED  END-TO-END  DELAY 


Figure  5.6  -  Throughput-May  Parformanca  of  Continuout  Tima  Modal 
For  Different  Daatination  Buffer  Sizes. 


the  expected  (long  run)  reward  rate  becomes  maximal.  From  Eq.  (5.13)  we 
can  relate  the  expected  reward  rate  to  the  (long  run)  average  input  rate 
(or  throughput)  and  the  average  delay  of  the  system, 

R  =  aX  -  T  (5.38) 

Eq.  (5.38)  indicates  that  under  an  optimal  policy,  for  each  X  the  cor¬ 
responding  T  is  the  lowest  that  can  be  achieved  (equivalently  for  each 
T,  X  is  the  maximal).  If  there  were  another  policy  that  could  generate 
lower  delay  for  the  same  X,  then  R  would  not  be  maximal.  Therefore,  the 
throughput -delay  curve  for  a  fixed  B  in  this  figure  defines  a  tight 
lower  bound  on  delay,  and  so  it  may  be  veiwed  as  the  optimal  network 
performance  for  the  dynamic  flow  control. 

When  there  is  a  lead-time  for  decisions  to  become  effective, 

the  throughput-delay,  performance  of  a  network,  when  window  sizes  are 

\ 

decided  upon  by  a  look-ahead  policy,  will  not  be  as  good  as  the  instan¬ 
taneous  feedback  case.  Fig.  5.6  shows  the  performance  of  the  network 
under  the  look-ahead  policy.  In  this  figure,  the  time  interval  6  is 
4  msec  (i.e.,  the  state  of  the  system  is  observed  every  4  msec),  and  the 

A 

destination  node  has  10  buffers  (B =  10).  The  round-trip  delay  (T  )  is 
assumed  to  be  30  time  units  for  the  entire  range  of  throughput,  and  the 

A  A 

look-ahead  time  is  29  (i.e.,  T^  =  Tf  -  1).  The  other  parameters  of  the 
system  are  the  same  as  in  Fig.  5.5.  The  solid  curve  in  Fig.  5.6  cor¬ 
responds  to  a  1-cycle-delay  (discrete  time)  decision  process,  which  is 
the  counterpart  of  the  continuous  time  model;  therefore,  this  curve 
represents  the  best  performance  that  can  be  achieved  in  the  discrete  time 
model  from  a  look-ahead  policy.  In  order  to  investigate  its  performance. 
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Figure  5.8  -  Throughput-Delay  Performance  of  Look-Ahead  Policy. 
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we  tested  the  look-ahead  policy  by  simple  simulation  experiments. 

In  the  simulation  experiments  we  used  Tf  =  30  and  T^  =  T  -  1  =  29. 
The  other  parameters  (C^,  C^,  ....  etc.)  were  the  same  as  in  Fig.  5.6. 

The  decision  table  was  set  up  by  finding  an  optimal  policy  for  the  cor¬ 
responding  1-cycle-delay  decision  process  (Section  5.3).  Initial  occu¬ 
pancy  of  the  buffer,  n(0),  and  the  input  rates  for  the  first  29  time 

A  A  A. 

units  (A(l),  A(2),  ...,  A(29))  were  also  initialized.  (When  the  simula¬ 
tion  run  is  long  enough,  the  initial  values  do  not  have  any  effect  on 
the  final  statistics.)  At  each  time  unit,  say  t,  an  input  to  the  buffer 
and  an  output  from  the  buffer  (with  probabilities  A(t)  and  y^,  respec¬ 
tively)  were  generated  by  the  Monte  Carlo  technique,  and  the  buffer 
occupancy  was  updated.  After  collecting  the  statistics  (the  input  rate 

a  /\ 

and  delay),  an  input  rate  for  time  t +  T^  was  decided  by  using  the  deci¬ 
sion  table,  and  the  simulation  time  was  advanced.  (Actually,  when  the 
decision  cycle  was  greater  than  1,  the  decisions  were  made  periodically; 
see  below.)  The  simulations  were  run  long  enough  (20,000  time  units) 
to  remove  the  effects  of  the  initial  values. 

The  points  indicated  by  "T"  in  Figure  5.6  indicate  the  perfor¬ 
mance  of  the  look-ahead  policy  when  decisions  are  made  at  each  time  unit 
(decision  cycle  =1).  It  is  seen  that  the  throughput-delay  performance 
in  this  case  is  very  close  to  the  optimal.  Considering  the  fact  that 

A  /V 

for  -  1  =  29,  the  performance  of  the  look-ahead  policy  is  the 

worst,  the  match  between  the  optimal  performance  and  the  performance 
due  to  the  look-ahead  policy  Is  very  encouraging. 

We  pointed  out  earlier  that  control  packets  to  carry  decisions 
to  the  source  node  are  piggybacked  on  messages  going  from  the  destination 


node  to  the  source  node,  and  if  there  is  no  such  message,  control  packets 
are  transmitted  by  themselves.  Therefore,  signalling  of  control  packets 
generates  overhead  traffic.  For  this  reason  it  is  not  a  good  policy  to 
decide  on  a  new  window  size  at  each  time  unit  and  it  is  better  to  make 
decisions  periodically.  In  Fig.  5.6  we  have  shown  the  performance  of 
the  look-ahead  policy  when  decisions  are  made  every  5  time  units  (">"), 
or  every  10  time  units  ("•").  (When  the  decision  cycle  is,  say  5,  the 

A  A 

decision  made  at  time  t  is  used  as  the  decision  for  times  t  +  1  through 
t  +  4;  at  time  t  +  5  a  new  decision  is  made.)  For  decision  cycles  larger 
than  10  the  throughput-delay  curves  differed  from  the  optimal  (the  solid 
curve)  by  too  much.  Therefore,  as  long  as  the  decision  cycle  is  below 
a  certain  limit,  the  throughput-delay  performance  of  the  periodic  look¬ 
ahead  policy  is  not  sensitive  to  the  exact  value  of  the  decision  cycle. 
This  property  is  important,  as  a  periodic  decision  reduces  the  overhead 
due  to  signalling  of  control  packets. 

A 

So  far  we  have  assumed  that  the  round-trip  delay,  T  ,  is  fixed 
for  the  entire  range  of  throughput;  in  reality  T ^  is  larger  for  higher 
throughputs.  To  investigate  the  sensitivity  of  the  throughput-delay 
performance  to  round-trip  delay,  we  used  the  simulation  program  described 
earlier,  but  changed  the  round-trip  delay  as  the  throughput  increased 

(the  look-ahead  time  was  also  changed  such  that  T  =  T  -  1).  The  points 

Jt  r 

indicated  by  in  Fig.  5.7  correspond  to  the  performance  of  the  system 

when  Tr  varies.  The  solid  curve  in  this  figure  is  the  throughput -delay 
curve  for  the  1-cycle-delay  decision  process  for  which  the  round-trip 

A 

delay  is  fixed  at  Tf =  30.  This  figure  indicates  that  the  performance  of 

A 

the  look-ahead  policy  is  not  sensitive  to  a  variation  in  T  ,  and  if  a 
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NORMALIZED  THROUGHPUT 


Figure  5.7  -  Throughput-Delay  Performance  of  Look-Ahead  Policy  for  Varying  Round-Trip  Delay. 


H» 


proper  average  round-trip  delay  is  used,  the  performance  is  close  to 
the  optimal. 

In  order  to  investigate  the  sensitivity  of  the  throughput -delay 

A 

performance  on  the  look-ahead  time  (T^) ,  we  simulated  a  network  with 

/v  /v 

parameters  shown  in  Fig.  5.6,  but  we  let  T^  =  Tf  -  2  =  28.  The  perfor- 

/\ 

mance  in  this  case  was  almost  identical  to  the  case  when  =  29.  To 

A 

use  a  look-ahead  time  smaller  than  T  -  2,  we  must  find  an  optimal  policy 
for  the  corresponding  n-cycle  delay  Markov  decision  process,  where  n>2. 
As  pointed  out  earlier,  the  solution  is  fairly  complicated  and  time  con¬ 
suming  because  the  corresponding  Markov  processes  may  have  more  than  one 
irreducible  chain.  Considering  the  fact  that  a  maximum  look-ahead  time 

A  A 

(i.e.,  T.  =  T  -  1)  results  in  a  performance  very  close  to  the  optimal 
r 

/\ 

(Fig.  5.6),  T  -  1  is  a  convenient  and  good  choice  for  the  look-ahead 
time  (fp. 

5.6  Conclusion 

In  this  chapter  we  developed  and  analyzed  a  dynamic  decision 
policy  to  select  the  window  size  to  control  the  flow  in  store-and- forward 
computer  networks.  The  dynamic  flow  control  developed  here  is  essen¬ 
tially  an  extension  of  the  static  flow  control  studied  in  the  previous 
chapter . 

Based  on  the  Markov  decision  process,  the  decision  problem  was 
formulated,  and  a  heuristic  solution  to  an  optimal  policy  was  presented 
as  an  alternate  to  the  laborious  exact  solution.  The  formulation  and 
the  heuristic  solution  is  general  in  the  sense  that  it  can  be  applied  to 
any  decision  process  in  which  there  is  a  lead-time  for  a  decision  to 
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become  effective,  a  common  situation  in  inventory  systems.  We  introduced 
a  parameter  a  which  reflects  our  preference  for  throughput  over  delay. 
This  parameter  can  be  considered  as  a  further  control  tool  to  limit  the 
throughput,  the  delay,  the  average  and/or  the  maximum  window  size,  and 
also  to  limit  the  destination  buffer  utilization  (Table  5.1). 

The  numerical  examples  of  Section  5.5  indicate  that  the  perfor¬ 
mance  of  a  network  when  window  sizes  are  selected  according  to  a  sub- 
optimal  policy  determined  by  the  heuristic  solution  is  very  close  to  the 
performance  of  an  optimal  policy.  Fig.  5.6.  This  figure  further  indi¬ 
cates  that  the  performance  of  a  periodic  decision  is  very  close  to  the 
optimal  so  long  as  the  decision  period  is  chosen  properly.  This  property 
is  of  significant  importance  in  practical  application,  as  a  periodic 
decision  reduces  the  overhead  due  to  the  signalling  of  control  packets. 
Lastly,  comparison  of  Fig.  5.6  and  Fig.  4.15  shows  that  the  throughput- 
delay  of  a  network  is  better  than  the  performance  under  a  static  control 
when  the  window  sizes  are  chosen  dynamically.  Therefore,  dynamic  flow 
control  can  further  enhance  the  traffic  handling  capability  of  computer 
networks . 
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CHAPTER  6 


INTERCONNECTION  OF  NETWORKS 

The  interconnection  of  computer  networks  has  recently  become 
a  subject  of  interest.  The  idea  is  to  make  the  computing  power  and 
resources  of  a  network  available  to  users  and  computers  attached  to 
other  networks  [CERF  74],  [SUNS  75],  In  this  chapter  we  study  some 
issues  related  to  the  interconnection  of  computer  networks,  otherwise 
known,  as  internetting.  As  in  previous  chapters,  we  aim  to  develop  ana¬ 
lytic  tools  for  the  performance  study  and  design  of  computer  networks. 

In  many  cases,  the  models  which  we  use  are  rather  simplistic;  neverthe¬ 
less,  they  help  us  understand  the  implications  of  real  system  behavior. 

We  begin  by  establishing  a  network  algebra  as  an  aid  in  the  systematic 
study  of  computer  network  interconnection;  we  then  look  at  some  design 
pioblems  for  allocation  of  resources  among  a  number  of  parallel-connected 
networks.  Finally,  we  discuss  buffer  allocation  strategies  in  nodes 
where  a  number  of  networks  merge  together. 

6.1  A  Network  Algebra 

This  section  is  concerned  with  the  performance  analysis  of 
systems  composed  of  series  or  parallel  interconnections  of  a  number  of 
computer  nets.  In  particular,  in  analogy  with  the  basic  laws  of  elec¬ 
trical  circuits  (Kirchoff's  Laws),  we  develop  a  network  algebra  for  the 
systematic  study  of  computer  networks,  and  characterize  the  equivalent 
network  (defined  below)  for  such  interconnected  nets. 
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For  the  purpose  of  the  present  study  we  need  closed  form  expres 
sions  for  the  maximum  throughput  and  delay  of  a  network  in  terms  of  its 
system  parameters  (window  size,  channel  capacities,  etc.)-  Such  expres¬ 
sions  could  not  be  found  for  the  model  developed  in  Section  4.4,  hence 
we  use  the  (less  realistic)  results  of  our  study  in  Section  4.3.  Accord 
ing  to  that  analysis  we  have  the  following  expressions  for  the  maximum 
network  throughput  and  delay: 


(4.5) 


(4.7) 


where  w  is  the  window  size,  n^  is  the  path  length  (number  of  hops)  and 
y(=yC)  is  the  maximum  transmission  rate  of  the  channels.  (Note  that 
when  w  =  0  or  y  =  0  there  is  no  traffic,  hence  the  delay  is  defined  to  be 
zero.)  The  above  equations  can  be  written  in  the  following  form 


/  £*  +  2 

I  — 2^ —  nf,  A*  >  0  and  y  >  0 
|  0  £*  =  0  or  y  =  0 


(6.2) 


where  9*  =  w/n, 
n 

The  parameter  £*,  as  we  will  see  shortly,  plays  an  important 
role  in  our  study  and  we  find  it  useful  to  reflect  on  its  physical  inter 
pretations.  The  window  size  w,  as  we  have  used  in  our  analysis,  is  the 
average  number  of  messages  in  the  network  (more  precisely,  the  average 
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number  of  messages  on  the  network  path  under  our  study).  There  are  n^ 
hops  (or  nodes)  on  the  path  (hence  there  are  2n^  hops  on  a  round  trip) 
therefore,  when  the  network  carries  a  traffic  determined  by  Eq.  (6.1), 


w/n,  will  be  the  average  number  of  messages  in  a  node,  of  which 


£*/2  are  being  transmitted  to  the  destination  and  the  other  half  are 
carrying  acknowledgements  to  the  source  node  (recall  that  we  have  assumed 
acknowledgements  are  piggybacked  on  messages  sent  from  the  destination 
node  to  the  source  node).  In  queueing  theory  the  average  number  of  cus¬ 
tomers  in  a  system  may  be  considered  to  be  the  system  load  [AGNE  74]; 
therefore,  £.*  =  w/n.  is  the  load  (or  the  load  factor)  of  a  node  (and 


<.*/2  is  the  load  of  a  channel)  on  the  path  due  to  the  maximum  traffic 


from  our  source  to  destination.  Note  that  through  the  window  control 


the  load  of  a  node  is  limited  to  w/n,  and  Eq.  (6.1)  determines  the  traf 


fic  which  generates  this  load,  hence  A*  is  the  throughput  capacity  of 


the  network.  If,  due  to  some  limitations  (which  we  discuss  shortly) 


the  throughput  (A)  is  less  than  A*  given  by  Eq.  (6.1),  then  the  load  on 


the  nodes  is  simply 


On  the  other  hand,  if  the  load  of  the  nodes  is  known  (say  Z  <  £*) ,  then 


the  throughput  and  delay  of  the  network  will  be 


In  the  sequel  we  will  refer  to  £*  =  w/n^  as  the  maximum  load 
factor;  to  A*  as  the  traffic  handling  capacity;  and  to  T*  as  the  maximum 
delay  of  a  network.  (When  there  is  no  ambiguity  we  refer  to  A*  and  T* 
simply  by  throughput  and  delay.)  Note  that  the  load  factor  (£)  is  dif¬ 
ferent  from  the  utilization  factor  p  (=  A/pC,  where  A  is  the  traffic  rate 
and  pC  is  the  maximum  transmission  rate  of  channels)  defined  as  the  frac¬ 
tion  of  the  channel  capacity  used  for  transmission. 

Eqs .  (6.1)  and  (6.2)  show  that  a  network  (for  the  purpose  of 

studying  its  throughput  and  delay)  can  be  totally  characterized  by  three 

parameters:  y  ( =  pC) ,  the  transmission  rate  of  the  channels;  n^,  the 

number  of  hops  messages  should  go  through;  and  H*  ( =  w/n^) ,  the  maximum 

load  factor  of  nodes.  Symbolically  we  designate  a  network  by  a  triple 

[£*,  n^,  y]  (Fig.  6.1-a).  In  this  figure  the  box  with  a  "wH  inside 

indicates  a  TC  with  window  size  w  in  front  of  a  network  with  path  length 

"n^"  and  channel  transmission  rate  y.  In  what  follows  we  will  find  a 

network  [£*,  n,  ,  y  ]  equivalent  to  a  system  which  is  composed  of  series 
0  n  0 
0 

and/or  parallel  connections  of  networks.  The  symbols  " -o— "  and  " 

will  be  used  to  indicate  series  and  parallel  connection  of  two  networks, 
respectively  (Figs.  6.1-b  and  6.1-c).  Throughout  this  section  we  assume 
that  w>0  and  y>0.  We  start  by  defining  equivalent  networks. 

Definition  6.1 

Two  networks  are  said  to  be  equivalent  if  the  traffic  handling 
capacity  and  the  maximum  delay  of  both  are  identical. 
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l6f.  nh-j,  7^]  -o-  [Sj  ,nh2«  V2) 

(b)  Series  Connection 


Figure  6.1  Short  Hend  Notation. 


6.1.1 


Series  Connection  of  Networks 


6. 1.1.1  Equal  Channel  Transmission  Rates 

Consider  two  networks  connected  to  each  other  in  series  (Fig. 
6.1-b)  which  have  the  same  channel  speed  y^  =  =  y.  According  to 

Eq.  (6.1)  the  traffic  handling  capacity  of  each  network  is 


«* 

X*  =  —  1  ■ 

l  l*  +  2 
l 


i  =  1,  2 


Depending  on  A* 


A*  ,  A*  >  A*  ,  or  A*<  A*  ,  we  consider  three  cases. 


Case  I:  A*  =  AJ 

In  this  case,  the  maximum  throughput  of  the  system  is  A*,  where 


Consider  Fig.  6.1-b  in  which  two  networks  are  connected  in 
series  (the  connecting  node  A  is  usually  referred  to  as  the  GATEWAY 
[CERF.  74]),  and  messages  from  source  S,  connected  to  network  1,  are 
transmitted  to  destination  D,  connected  to  network  2.  We  assume  that 
the  operation  of  each  of  the  two  networks  is  independent  of  the  other, 
which  means  that  when  a  message  from  network  1  is  delivered  to  network 
2  (via  the  GATEWAY) ,  the  ACK  for  this  message  is  sent  back  to  the  TC  of 
network  1;  a  copy  of  this  message  is  kept  by  the  TC  of  the  second  net¬ 
work  until  the  ACK  is  received  from  the  destination  node.  The  only 
interaction  between  the  two  networks  arises  from  their  throughput  han¬ 
dling  limitations.  We  start  with  the  special  case  in  which  the  channel 
transmission  rates  of  both  networks  are  equal. 


Ae  =  X1  =  +  2  Y 


X2  =  £*  +  2  Y 


(6.6) 


i.e.,  the  maximum  load  factor  of  the  two  networks  is  identical.  The 


maximum  total  delay  is  the  sum  of  the  delays  in  the  two  networks 


Comparing  Eqs.  (6.6)  with  (6.1),  and  Eqs.  (6.7)  with  (6.2),  we  note  that 


' 


A 


i 


when  £*  =  £*,  then  the  two  networks  connected  in  series  are  equivalent 


to  a  network  with  the  maximum  load  factor  of 


fc*  =  =  £* 


and  the  equivalent  number  of  hops  is  the  sum  of  the  number  of  hops  of 
the  two  networks.  Thus  we  have 


[**,  n  ,  y]  -o-  [£*,  n  ,  y]  =>  [£*,  n  +n  y] 
nl  n2  nl  n2 


The  equivalent  window  size  is 


%1  +  nh2  nhj  +  nh2 

we  =  nT  W1  =  n7  W2 


1  “2 

The  throughput  handling  capacity  and  the  maximum  delay  of  the  equivalent 
network  is  given  by  Eqs.  (6.6)  and  (6.7). 
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_ 


_ ■  * 

. » ^ .. 


Case  II :  A*  >  A* 

In  this  case  network  1  is  the  bottleneck  and  its  traffic  han¬ 
dling  capacity  determines  the  system  throughput. 


A*  =  A? 
e  1 

\ 

Q* 

*1 

~  Z*  *  2  y 

Note  that  when  Yj  =  Y2»  then  A*  <  A*  is  equivalent  to  <  Z*,  i.e., 
the  network  with  the  smaller  maximum  load  factor  is  the  bottleneck.  The 
delay  of  network  2  can  be  calculated  by  considering  the  fact  that  for  a 
traffic  equal  to  Aj  ,  the  load  at  the  nodes  of  network  2  is  (notice 
that  A*  =  AJ).  From  Eq.  (6.5)  we  have 

Z*  *  2 

?2  =  ~ 2Y  "h2 

Hence  the  total  delay  becomes 


T*  =  T*  +  T„ 
e  1  2 


Z*  +  2  Z*  +  2 

=  \1  +  ~2T~  "h2 


«,*  +  2 

=  — —  ("K,  * 


Therefore,  when  A*  >  A*  we  have 


[*J. 


Y] 


[* 


* 

2’ 


Y]  => 


[£1’  nh. 


+  n. 


Y] 


The  equivalent  window  size  will  become 
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6. 1.1.2  Non-Equal  Channel  Transmission  Rates 

When  the  channel  transmission  rates  of  networks  are  not  iden¬ 
tical,  the  analysis  becomes  a  bit  more  complex  as  we  will  see  shortly. 

As  in  Section  6. 1.1.1,  we  start  with  two  networks  connected  in  series 
and  consider  three  cases:  ^2  =  ^1  ’  ^2  ^1  an<*  X2  < 

Case  I:  A*  =  A* 

When  the  traffic  handling  capacity  of  the  two  networks  is  equal 

we  have 


*i 


*2 


«,*  +  2  ’l  a*  +  2  '2 


Solving  the  above  equation  fcr  £*  we  get 


l*  =  2 


®7Y2 


•1  “  *  *y, 

Therefore,  the  maximum  load  factors  of  the  two  networks  are  related  to 
each  other  by  the  following  relationship: 


SL*y. 

1—L 


0*  -  2 _ 

1  z  £*(y.  -  y.)  +  2y . 

3  '1  3  i 

The  throughput  of  the  system  will  be 


i,j  =  1,2  and  i  t  3 


*-1 


\  *  -  \  *  _  _ 

e  "  A1  "  Jl*  +  2  '1 


l*2 

X2  =  Z*  +  2  Y2 


The  maximum  system  delay  is  the  sum  of  the  delay  in  the  two  networks: 
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T*  =  T*  +  t; 
e  1  2 


and  the  equivalent  window  size  v  ’ 1  be 


We  can  also  replace  the  indices  1  and  2  in  the  above  equations  to  get 
an  equivalent  network,  i.e.. 
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Ye  -  y2 


z*  =  z* 

e  2 


V  1  * 


(£->)*>£ 


'5(7 


li  "»2J  ”2 

Y2 


and  the  equivalent  window  size  will  be 


w  =  £*n  =  1  + - 

e  e  h  /  Y 

*  L  '5  7 


(£-*)♦*£  ** 


Case  II:  A*  >  XJ 


In  this  case  network  1  is  the  bottleneck,  hence  we  have 


X*  =  X*  =  - = _ v 

®  A1  Up  2  Y1 


(6.8) 


When  network  2  carries  a  traffic  equal  to  X*  the  load  at  its  nodes 


becomes 


.  2 

2  4J(Y2-Yj)  ♦  2y2 


(6.9) 


and  it  is  easy  to  demonstrate  that  X*  >  AJ  is  equivalent  to  Z*  >  Z^  . 
For  a  load  of  Z^  the  delay  at  network  2  becomes 

Z  +  2 

T2  =  2Y2  nh2 

Using  the  value  of  Z 2  from  Eq.  (6.9),  and  after  some  algebra,  we  get 


Z*  +  2 

T  a  — - 

2  2y 


1  t. (la.  i)* 

1 '  Yi  /  Y, 


Y,  "^2 


I 


The  total  system  delay  is  therefore 


T*  =  T*  +  T_ 
e  1  2 


4J  +  2  [  2  "h2  ' 

1  +  *  yt  nr~  "hi 

L  ^(vT-O*^  h‘J 


(6.10) 


Comparing  Eqs.  (6.8)  with  (6.1),  and  (6.10)  with  (6.2),  we  have,  if 


A*  >  A*,  then 


nh  *  Y1J  -0-  t£2’  V’  Y2^  >  t£l*  nh  ’  Yl] 

1  2  e 


where 


n.  =  1  +  - 

e  L  i*i- 


Y,  n.  h 


l).2*  V  1 

1 V  Yi  /  Yj 


and  the  equivalent  window  size  will  be 


2  2 

w  =  1  + - - -  — -  w, 

e  /  Y 2  \  Y2  nh  1 

L  n*  .  i  +  2  —  nl  J 
1 V  Y1  1  Y1 


Case  III:  A*  <  AJ 


In  this  case  network  2  is  the  bottleneck  for  the  throughput. 
The  analysis  is  similar  to  Case  II,  only  indices  1  and  2  should  be 
interchanged. 


In  general,  when  several  networks  are  connected  in  series,  the 
bottleneck  for  the  throughput  is  the  network  with  the  smallest  traffic 
handling  capacity,  and  we  have 

VHH  [t;.  nh  ,  y.]  ->  [l;,  nh  .  Ye] 
i=l  i  e 


Let  i*  be  such  that 


then 


A*  =  min  {A*} 
l<isN  1 


Ye  =  V 

ll  -  lb 


where 


n,  =  a.  4n, 
he  i*  hi* 


v  ■  £ 


■h. 

i 


1  '  Yi*  /  Yi#  j 
The  equivalent  window  size,  the  throughput  and  the  delay  will  be 


y  i  n. 


w  =  ex. *w. , 

e  l* 


A*  =  A* 

e  i* 

l*  ♦  2 

Te  =  2y..  VV, 


If  Yj  =  Y  for  all  i,  we  get  the  same  results  as  in  Section  6. 1.1.1, 


f) .  1 . 2  Parallel  Connection  of  Networks 

As  in  the  analysis  of  series  connections,  we  assume  that  the 
operation  of  each  of  the  paral lei -connected  networks  is  independent  of 
the  other.  Parallel  connection  is  similar  to  the  case  in  which  a  user  has 
a  number  of  networks  at  his  disposal  to  send  his  messages  to  the  receiver. 
Note  that  we  assume  the  networks  do  not  share  common  windows  (we  will 
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consider  this  case  in  the  following  sections).  Unless  the  maximum  load 

factors  of  the  networks  are  equal,  the  analysis  becomes  very  complicated 

and  no  simple  expression  tor  the  equivalent  network  can  be  derived; 

therefore,  in  what  follows  we  will  assume  l *  =  l*  for  all  i. 

1 

Consider  N  networks  connected  in  parallel.  When  each  network 
carries  its  maximum  traffic,  the  total  traffic  carried  by  the  system 
becomes  the  sum  of  the  traffic  handling  capacities  of  each  network, 
hence  we  have 


X* 

e 


n 

£.  ITT7  Yi 
1=1  i 


or 

K  *  E  n  «>•>» 

i 

The  system  delay  can  be  calculated  by  considering  the  fact  that  a  frac¬ 
tion  At/A*  of  the  total  traffic  A*  that  is  sent  through  network  i 
i  e  e 

experiences  a  delay  of  Tt,  hence  the  average  system  delay  becomes 


T* 

e 


N  A* 

■  E  p-  ■',* 

i=l  e 


Using  the  values  of  At  and  Tt  from  Eqs.  (6.1)  and  (6.2),  and  considering 


the  fact  that  l*  =  i*  for  all  i,  we  have 

l 


T*  = 
e 


i*  ♦  2 
N 

2  5 


(6.12) 


Lqs.  (6.11)  and  (6.12)  indicate  that  for  the  equivalent  network  the 
maximum  channel  transmission  rate  is  the  sum  of  the  transmission  rates 


of  the  networks,  and  the  equivalent  number  of  hops  is  also  the  sum  of 
the  number  of  hops  in  each  network. 

Summing  up,  we  have  the  following: 


If 


X,*  =  8*  i  =  1,  2,  ....  N 
1 


Then  N 


The  equivalent  window  size  will  be 


w 

e 


w. 

J 


for  any  j 


The  traffic  handling  capacity  and  the  delay  of  the  equivalent  network 
can  be  found  by  using  Eqs.  (6.11)  and  (6.12). 

Table  6.1  shows  a  summary  of  our  results  for  the  series  and/or 
parallel  connection  of  networks  when  w^  >  0  and  >  0.  For  the  series 
connection,  the  network  with  the  lowest  maximum  traffic  handling  capac¬ 
ity  is  the  bottleneck  of  the  system.  In  the  special  case  where  the 
channel  transmission  rates  of  all  networks  are  identical,  the  number  of 
hops  of  the  equivalent  network  becomes  the  same  as  the  sum  of  the  number 
of  hops  of  the  networks.  For  the  parallel  connection  we  have  shown  only 
the  results  for  the  special  case  where  the  maximum  load  factors  of  all 
of  the  networks  are  identical;  the  results  for  the  general  case  are  too 
complex  to  be  useful. 


6.2  Optimal  Allocation  of  Resources 

In  the  previous  section  we  were  concerned  with  the  analysis  of 
systems  composed  of  a  number  of  networks  connected  to  each  other  in 
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series  and/or  parallel,  and  we  characterized  equivalent  networks  to 
such  interconnected  nets.  In  this  section  we  look  at  some  design  prob¬ 
lems  for  parallel  networks.  Throughout  our  study  of  flow  control,  we 
have  focused  on  a  communication  path  of  a  network,  and  by  using  the 
number  of  hops  and  capacity  of  channels  along  a  path,  we  could  find  its 
traffic  handling  capacity  as  a  function  of  the  window  size  (our  flow 
control  variable).  Our  distinction  between  a  network  and  a  path  has 
been  very  loose;  for  example,  the  analysis  of  parallel  networks  (Section 
6.1.2)  can  be  applied  to  two  separate  networks  connected  in  parallel,  or 
to  two  disjoint  paths  (between  a  source  and  destination)  with  indepen¬ 
dent  window  controls  in  the  same  network.  The  design  problem  which  we 
are  going  to  study  is  particularly  useful  when  we  take  the  latter  point 
of  view  (i.e.,  two  paths  of  the  same  network).  It  is,  however,  impor¬ 
tant  to  realize  that  our  study  is  based  on  the  assumption  that  the 
paths  are  stochastically  independent  of  each  other. 

The  design  problems  which  we  address  in  this  section  are  con¬ 
cerned  with  the  optimal  allocation  of  (limited)  resources  among  a  number 
of  parallel  communication  paths.  The  resources  we  consider  are  window 
buffers  and  transmission  capacity.  The  optimization  can  be  done  with 
respect  to  throughput  (maximize  the  throughput),  the  delay  (minimize 
the  delay),  or  some  throughput -delay  combination.  In  some  special  cases 
the  optimal  allocation  with  respect  to  throughput  also  results  in  the 
minimal  delay;  however,  in  most  cases,  the  maximum  throughput  does  not 
occur  at  minimum  delay.  For  example,  the  resources  can  be  allocated  so 
that  the  delay  becomes  zero;  however,  this  allocation  also  results  in 
zero  throughput.  When  the  optimization  is  done  with  respect  to 


throughput -del ay,  a  proper  objective  function  should  be  defined  to 
incorporate  the  opposing  effects  of  throughput  and  delay  (in  Chapter  5 
we  used  such  a  function  in  our  optimal  decision  problem).  In  the  dis¬ 
cussion  below  we  first  carry  out  the  optimization  with  respect  to  maxi¬ 
mal  throughput.  Later  we  consider  the  case  when  delay  is  also  a  measure 
of  performance.  We  start  with  the  general  case  of  allocating  both 
window  buffers  and  channel  capacity,  and  then  we  consider  the  special 
cases  of  allocating  either  window  buffers  or  channel  capacity. 

6.2.1  Optimal  Allocation  of  Window  Buffers  and  Channel  Capacities 

Consider  Fig.  6.1-c  in  which  two  paths  of  lengths  n  and  n 

hl  h2 

(number  of  hops)  can  be  used  to  send  messages  from  source  S  to  dest i - 
nation  U.  Assume  that  we  have  a  pool  of  w  buffers  to  use  in  the  traffic 
controllers  of  the  two  paths,  and  a  total  channel  capacity  of  C  (or  a 
transmission  rate  of  y  =  1/pC,  1/u  being  the  average  message  length)  to 
allocate  to  the  two  paths.  The  question  is:  "what  is  the  optimal 

J 

allocation  of  these  resources  to  the  paths,  in  order  to  optimize  the 
total  traffic  handling  capacity  between  source  S  and  destination  D?" 

The  problem  can  be  formally  stated  as  follows: 

Given:  path  lengths  n,  and  n, 

hl  h2 

Optimize:  the  traffic  handling  capacity 

With  respect  to:  window  sizes  (w^  and  w^)  and 
channel  capacities  (C}  and  C2) 

Such  that: 

Wj  +  W2  =  w  (6.13) 
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The  formal  solution  to  this  set  of  nonlinear  equations  is 


and 


w.  = 


n,  +  n, 
>'i  "2 


Y,  = 


n.  +  nk 
hl  h2 


and  from  constraints  (6.13)  and  (6.14)  we  get 


(<>.  1 7 -a ) 


(6.18-a) 


and 


Y2  = 


(6.17-b) 


(6. 18-b) 


The  above  values  for  w^'s  and  Y^'s  are  not,  however,  the  optimal 
solution!  They  correspond  to  the  inflection  point  of  the  X*  surface 
specified  by  Eq.  (6.16).  In  fact,  the  optimal  solution  is  located  at 
a  boundary  point  of  the  X*  surface,  which  is  obtained  by  allocating  the 
network  resources  to  only  one  of  the  paths  (note  that  the  variables  w^ 
and  Yj  are  bounded  below  by  zero,  and  bounded  above  by  w  and  y,  respec¬ 
tively).  This  is  shown  in  the  following  theorem. 


Theorem  6. 1 

The  maximum  traffic  handling  capacity  of  two  paths  in  parallel 
is  obtained  by  allocating  all  resources  (window  buffers  and  channel 
capacity)  to  the  path  with  shorter  length. 


Proof 


Consider  two  paths  in  parallel  with  lengths  n  and  n.  ,  where 

hj  h2 

n,  <  n 
hl“  h2 

II  we  allocate  all  the  resources  to  path  1,  then  the  traffic  handling 
capacity  becomes 


w  +  2n, 


‘1 


In  order  to  establish  the  proof  we  must  show  that 


w  +  2n,  Y  -  w  2n.  Y1  +  w  -  w.  +  2nu  (Y'V  (6-19) 


‘1 


1 


‘1 


1 


for  all  0  w^  <_  w  and  0  <_  y 


We  can  rewrite  the  inequality  (6.19)  as 


w  +  2n, 


w "  wi  1 ..  >  r  wi 

w  -  w,  +  2n,  -  w.  +  2n 

1  n„ J  LI 


w  -  w 


1 


hj  w  •  "i  *  2"h2  j 


Because  n,  <  n  we  have 
hl  "  h2 


(6.20) 


w  -  \v 

- - — = -  >  0  for  all  0  <  w  <  w  (6.21) 

w  +  2nL  w  -  w,  +  2n,  —  —  1  —  v  ’ 


‘1 


1 


The  condition  0  <_  w^  <_  w  implies  that 

w, 

_ *L _  > _ l _ 

w  +  2n,  —  w.  ♦  2n, 

hl  1  hl 

Hence  the  content  of  the  brackets  on  the  left-hand  side  of  Eq.  (6.20)  is 
non-negative  and  larger  than  the  content  of  the  right-hand  side  bracket. 
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Q.E.I). 


j 

I 

I 
1 

i 

I 

Tlic  fact  that  y^  <_  y  establishes  the  proof. 

The  traffic  handling  capacity  of  two  paths  in  parallel  for 

\ 

different  resource  allocations  are  shown  in  Figs.  6.2-a  and  Q.2-b.  In 

J 

these  figures  the  quantities  are  normalized  as  follows:  The  path  lengths 
(n^  and  n^  )  and  the  window  sizes  (w^  and  w9)  are  normalized  by  w,  the 

i  2 

total  number  of  window  buffers;  the  channel  transmission  rates  (y  and  y  ) 

1  £. 

and  the  throughput  (A*)  are  normalized  by  y,  the  total  transmission  rate. 

These  normalizations  simplify  the  presentation. 

The  throughput  as  a  function  of  window  allocation  is  shown  in 

Fig.  6.2-a  (note  that  w9/w  =  1  -  w  /w)  where  we  have  plotted  the  curves 

for  several  constant  y  /y.  For  (normalized)  path  lengths  of  n  /w  =  0.1 

hl 

and  n  /w  =  0.4,  Eqs.  (6.17)  and  (6.18)  give  w  /w  =  0.2,  w^/w  =  0.8, 

2  1 

Yj/y  =  0.2  and  ~  0.8.  w^/w  =  0.2  in  Fig.  6-2. a  is  the  point  where 

all  of  the  constant  y^/y  curves  meet.  Note  that  at  this  point  the 
throughput  is  maximum  only  for  the  curve  y^/y  =  0.2.  Fig.  6.2-a  shows 
that  the  optimal  allocation  (point  A)  is  where  all  resources  are  dedi¬ 
cated  to  path  1,  which  has  a  shorter  length. 

Fig.  6.2-b  shows  the  (normalized)  traffic  handling  capacity  of 
these  paths  as  a  function  of  the  (normalized)  channel  transmission  rate 
of  path  1  for  different  values  of  w^/w  (the  normalized  window  buffer 
allocated  to  path  1).  This  figure  shows  also  that  the  maximum  through¬ 
put  is  achieved  at  point  B  by  allocating  all  the  resources  to  the 
shorter  path.  Note  that  when  window  buffers  are  allocated  according  to 
Eq.  (6.17),  then  the  throughput  becomes  insensitive  to  the  capacity 
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W|/w 

w  =  wi  +  W2 

7  =71  +72 


Figure  6.2-a.  Traffic  Handling  Capacity  vs  Window  Buffer 
Allocation  (n^/w  =  0.1,  n^/w  ■  0.4). 


/ 
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allocation  (the  line  of  w^/w  =  0.2  in  Fig.  6.2-b).  All  throughput  lines 
for  fixed-window  buffer  allocations  are  tangent  to  a  common  envelope 
which  specifies  the  upper  bound  on  the  throughput  achievable  for  a 
fixed  y1/Y- 

In  the  case  of  more  than  two  parallel  paths  we  can  inductively 
generalize  Theorem  6.1  to  get  the  following  result: 

Theorem  6.2 

The  optimal  allocation  of  resources  among  a  number  of  parallel 
paths  is  to  assign  all  resources  to  a  path  with  the  shortest  length. 

Proof : 

The  proof  is  by  induction  on  N,  the  number  of  parallel  paths. 
For  N  =  2  the  proof  has  already  been  established  in  Theorem  6.1,  and  so 
suppose  that  it  holds  for  all  numbers  of  paths  less  than  or  equal  to  N, 
where  Nj>2.  Consider  (N  +  1)  parallel -connected  paths,  ordered  in  an 
arbitrary  manner.  Assume  path  i,  l^i^N,  has  the  shortest  length  among 
the  first  N  paths,  i.e.,  n^<^n^  ,  l£j£N.  From  the  induction  hypoth- 

i  j 

esis  we  already  know  that  among  the  first  N  paths,  path  i  is  the  one 
that  should  be  allocated  all  of  the  resources.  Now  consider  paths  i  and 
(N +  1).  By  Theorem  6.1,  all  resources  should  be  allocated  to  one  of  the 
two  paths,  i  or  (N+l),  whichever  has  the  shortest  length.  Thus  it 
follows  that  for  any  number  of  parallel -connected  paths,  all  resources 
should  be  allocated  to  the  one  with  the  shortest  length. 

Q.B.D. 
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6.2.2  Optimal  Window  Buffer  Allocation 

If  the  channel  capacities  of  the  number  of  paths  operating  in 
parallel  arc  known,  then  the  problem  is  how  to  allocate  a  pool  of  wimlow 
buffers  to  these  paths  in  order  to  maximize  the  traffic  handling  capac¬ 
ity.  For  any  window  allocation  w^  and  w2,  the  throughput  is  given  by 
Eq.  (6.15).  Because  the  condition  (6.13)  should  also  be  satisfied,  the 
optimization  problem  reduces  to 


Maximize : 

vl  (w-wx) 

w.  +  2n,  ^1  +  (w-w.)  +  2n.  ^2 

1  Hi  X 


With  respect  to:  w 


Such  that :  0  <_  w^  w 

This  optimization  problem  can  be  easily  solved  and  results  in 
the  following  value  for  w^ 


1  +  2 


nK  +  nh 

hl  h2 


JVh, 


2n, 


-*  *  Jvy 


(6.22) 


If  the  value  of  w^  found  above  is  not  within  the  limit  0  <_  w^  <_  w,  then 
it  should  be  set  to  0  or  w.  It  is  easy  to  show  that  in  order  for 
0  £  w^  £  w  (i =  1,2),  we  must  have 


W 

— ~ 1  2  Yj  -  Yi  - 


(w  +  2nh  ) 


(w  +  2nh  ) ' 

_ i_ 

4nw  nw 

hi  hj 


i,j  =  1,2 


it) 


Note  that  the  optimal  allocation  depends  on  the  channel  capacity  and 
the  path  length,  and  in  general  the  path  with  the  larger  channel 
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capacity  and/or  the  smaller  length  is  allocated  a  larger  portion  of  the 
window  size.  Figs.  6.2-a  and  6.2-b  show  several  examples  of  optimal 
allocations.  In  Fig.  6.2-a,  where  we  have  plotted  the  total  throughput 
as  a  function  of  the  window  allocated  to  path  1  for  different  channel 
capacities,  the  maximum  point  of  each  curve  corresponds  to  the  optimal 
window  allocation.  Note  that  when  y^/ (Yj  +  Y2)  is  zero  or  one>  the 
optima]  allocation  is  located  at  a  boundary  point.  When 


Yi 


‘h. 

1 


Y1  +  Y2 


+  n. 


i=  1.2 


then  the  optimal  window  allocation  becomes 

n. 


w.  =  — 
1  n 


h. 

x 


hl  h2 


w 


i  =  1.2 


This  means  that  when  the  channel  capacities  are  proportional  to  path 
lengths,  then  the  best  allocation  is  also  the  one  which  assigns  the 
window  buffers  proportional  to  their  path  lengths.  In  Fig.  6.2-a,  the 
curve  for  =  ®-2  is  such  a  curve,  and  the  best  window  allo¬ 

cation  is  Wj/w  =  0.2.  In  Fig.  6.2-b  we  can  use  the  optimal  envelope 
to  find  the  best  window  allocation.  For  a  given  y  (.y  y  2)  •  tfie  !ine 
of  constant  Wj/w  which  is  tangent  to  the  envelope  at  the  point  with 
the  given  abscissa  corresponds  to  the  optimal  allocation. 


6.2.3  Optimal  Capacity  Allocation 


When  the  window  sizes  of  the  paths  are  known,  we  can  take  the 
same  approach  as  we  did  in  the  previous  sections  to  find  the  optimal 
allocation  of  a  fixed  capacity  among  a  number  of  parallel  paths.  The 


optimization  problem  becomes 


Max  Lmize : 


wi  *  Y 


With  respect  to:  y 


1 


Such  that :  y 


1 


Yi  *  w.;var~  Y2 

^  n  2 

and  y2 
+  Y2  =  Y 


(6.23) 


The  optimization  in  this  case  is  fairly  simple,  as  demonstrated  by  the 
following  theorem: 


Theorem  6.3 

If  the  window  sizes  of  two  paths  in  parallel  are  known,  then 

the  optimal  allocation  of  channel  capacity  to  the  two  paths  is  to  assign 

all  of  the  capacity  to  the  path  with  the  larger  maximum  load  factor 

(£.?  =  w^/n^  )•  If  the  maximum  load  factors  of  the  two  paths  are  identi- 
l 

cal,  then  the  maximal  throughput  is  not  sensitive  to  the  capacity  of 
each  individual  channel. 


Proof : 

We  can  find  Y2  in  terms  of  Yj »  and  using  Eq.  (6.23),  we  get 


w  + 
1 


2n. 


+  2n, 


+  2n, 


0  1  Yj  £  Y 

(6.24) 


IS  the  coefficient  of  Yj  in  Eq.  (6.24)  is  positive,  then  X* 
becomes  maximum  when  Yj  =  YJ  if  it  is  negative  then  y^  =  0  results  in 
the  optimal  X*;  and  finally,  if  the  coefficient  is  zero,  then  the 
throughput  is  independent  of  capacity  allocation.  In  order  for  Yj  =  Y 
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to  be  optimal,  the  following  condition  should  be  satisfied: 


w,  +  2n.  w.  +  2n. 

1  hj  2  h2 

The  above  inequality  can  be  reduced  to 

W1  W2 

=  — —  >  — —  =  fc* 

1  nK  nh  2 

hl  h2 

Similarly,  for  y  *  0  to  be  optimal  we  should  have 


w  w 

1  2 

£?  =  —  <  —  =«* 
1  n,  n,  2 

hl  h2 


and  if 


_^1_  _  ^2_ 

nh1  nh2 


then  any  allocation  results  in  the  same  throughput  of 


w  w 

\*  -  - i -  y  =  _ - _  v 

w.  +  2n,  '  w_  +  2n. 

1  hj  2  h2 


Q.E.D. 


We  use  Figs.  6.2-a  and  6.2-b  to  show  some  examples  of  optimal 

capacity  allocation.  In  these  figures  the  number  of  hops  of  the  two 

parallel  paths  are  such  that  n.  /(w,  +  w.)  =  0.1  and  n.  /(w.+w,,)  =  0.4. 

l  &  n  2  a  ^ 

For  these  values  of  n,  and  n.  ,  in  order  for  the  maximum  load  factors 

hl  h2 

to  be  identical  (w/n,  =  w/n,  )  we  require  * 

hl  h2 


w  w 

- -i - =0.2  and  - - -  =0.8 

W1  +  w2  wl  *  w2 
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If  Wj/(Wj  w7)  <  0.2,  then  the  best  performance  (.maximum  throughput)  is 
obtained  when  =  0  (or  y^  =  Y).  On  the  other  hand,  if  and  w,  are 
such  that  w/(Wj+wp  >  0.2  (or  >  Jt*) ,  then  y(  =  y  gives  the-' host 
performance.  Note  that  if  Wj/(Wj  +  w.,)  =  0.2  (or  8.J  =  £*)  then  a]]  of 
the  curves  for  different  values  of  Yj/iYj  +  Y2)  result  in  thd  same 
throughput . 

In  Fig.  6.2-b  the  total  traffic  handling  capacity  for  different 
window  sizes  has  been  shown  as  a  function  of  (relative)  channel  capacity 
of  path  1.  Note  that  if  £*  <  £*,  then  the  slope  of  the  throughput  line 
is  negative  and  the  maximum  throughput  is  achieved  when  y^/y  =  0;  for 
#.*  =  £*  the  throughput  line  becomes  parallel  to  the  y^/y  axis  and  the 
throughput  is  insensitive  to  capacity  allocation;  if  >  £*,  the  slope 
is  positive  and  y^  =  y  results  in  the  best  performance. 

Theorem  6.3  can  be  generalized  to  the  case  of  more  than  two 
paths  in  parallel. 

Theorem  6.4 

If  the  window  sizes  of  N  (j>  2)  paths  in  parallel  are  known,  then 

the  optimal  allocation  (with  respect  to  throughput)  of  a  total  channel 

capacity  to  the  paths  is  to  assign  all  of  the  capacity  to  the  path  with 

the  largest  maximum  load  factor  (£?  =  w^/n^  ) . 

"i 

Proof : 

We  prove  this  theorem  by  contradiction.  Assume  there  is  a 
capacity  allocation,  claimed  to  be  optimal,  which  does  not  agree  with  the 
theorem.  We  show  there  is  at  least  one  allocation  which  gives  a  better 
performance.  Consider  paths  i  and  j  where 
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w.  W  . 

I*  =  —  >  -J-  =  l* 

1  nh.  nh.  J 
1  J 


and  the  allocated  capacities  of  these  paths  are  such  that 


+  0  and  ^  0  (or  y^  t  0  and  y  t  0) . 

The  contribution  of  these  two  paths  to  the  total  throughput  is 


-  y  +  --  J -  Y 

w.  +  2n,  1  i  w.  +  2n,  j 

•  i>i  j  hj 

By  Theorem  6.3  we  know  that  if  we  assign  the  capacity  to  path  i, 

then  the  throughput  will  be  better  than  given  above,  i.e.. 


-  (Y .  +  Y  . )  > - s - Y .  +  - - y  . 

w.  +  n.  l'i  ’  w.  +  ?n,  1 1  w.  ♦  2n.  i 

i  h.  J  l  h.  J  h.  J 

l  t  J 

Therefore,  there  is  an  allocation  different  from  the  given  one,  which 

results  in  a  higher  traffic  handling  capacity.  This  contradicts  the 


claim  that  the  given  allocation  is  optimal. 


Q.E.D. 


6.2.4  Delay  Considerations 


Before  discussing  the  optimal  resource  allocation  when  delay 
is  also  a  measure  of  performance,  we  find  it  useful  to  reflect  on  the 
characteristics  of  the  (maximum)  delay  T*. 

The  maximum  delay  of  a  path  (which  occurs  when  the  throughput 
is  maximal,  X*)  when  the  window  size  is  w  and  its  channel  transmission 
rate  is  y  was  found  in  Section  4.3  to  be 
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w  +  2n. 
_ h 

2  Y 


w  >  0  and  y  >  l) 


(4.7) 


(  0  w  =  0  and  y  =  0 

f  The  discontinuity  of  T*  when  either  w  or  y  is  zero  causes  some  problems 

which  are  more  severe  (from  the  analysis  point  of  view)  when  the  channel 
capacity  (or  y)  is  decreased  to  zero.  In  a  queueing  system  (e.g.,  an 
M/M/1  system),  when  the  input  rate  is  zero,  although  there  is  no  traffic 
to  the  system,  the  average  system  time  is  equal  to  the  average  service 
time.  The  physical  interpretation  of  this  result  is  that,  when  there  is 
no  traffic,  i£  a  "customer"  arrives  in  the  system,  its  delay  is  the 
average  service  time.  Our  point  of  view  in  analysis  of  networks  is  dif¬ 
ferent  from  above,  in  the  sense  that  we  consider  the  delay  of  messages 
that  actually  enter  the  system.  When  w  or  y  is  zero,  because  there  is 
no  traffic  flowing  through  the  system,  we  define  the  delay  to  be  zero. 

In  Figs.  6.3-a  and  -b  we  sketch  the  maximal  delay  and  throughput  as  a 
function  of  channel  transmission  rate.  When  y  decreases  to  zero,  in 
order  to  get  a  throughput  of  X*  ,  we  incur  a  large  delay.  For  a  very 
small  (but  non  zero)  capacity,  the  delay  is  unbounded;  however,  when 
y  reaches  zero,  we  define  the  delay  to  be  zero.  The  same  discontinuity 
is  observed  when  the  window  size  is  decreased  to  zero  but  y  held 
fixed.  Figs.  6.3-c  and  -d  show  the  dependency  of  T*  and  X*  on  w.  Note 
that  the  discontinuity  of  T*  at  the  origin  due  to  zero  window  size  is 
not  as  large  as  the  discontinuity  in  Fig.  6.3-a  (yi  0). 

The  effect  of  the  discontinuity  of  T*  for  two  paths  in  parallel 
can  be  seen  in  Fig.  6.4-a.  In  this  figure  (which  is  the  counterpart  of 
Fig.  6.2-a)  the  (normalized)  total  delay  as  a  function  of  window  size 


2.r>l 


of  path  J  is  shown  for  different  capacity  allocations.  Similar  to  Fig. 
b.2-a  the  path  lengths,  window  size  and  channel  capacity  are  normalized. 
Note  how  the  curves  for  small  Yj/(Y]  +  become  unbounded  as 
Kj/lw  +  w2)  increases  to  1,  which  is  the  result  of  small  channel  capac¬ 
ity  (Yj)  and  a  large  window  size  (w^).  When  all  the  capacity  y  is  allo¬ 
cated  to  path  2  (Yj/y  =  0),  then  no  traffic  is  carried  by  path  1  and 
the  delay  is  solely  due  to  the  second  path.  (The  delay  for  y^/y  =  0  and 
Yj/y  =  1  are  shown  in  dashed  lines.) 

In  Fig.  6.4-b  we  have  shown  the  normalized  delay  as  a  function 
of  the  capacity  allocated  to  path  1  for  different  window  allocations. 

The  range  of  y^/y  in  this  figure  is  (0.1)2,  0.98);  for  Yj/f  =  0  or  1  the 
curves  become  discontinued  (for  example  for  the  w^/  w=  0  curve,  the 
delay  is  zero  at  y j / Y  =  1).  A1 1  of  the  delay  curves  (within  the  range 
of  this  figure)  are  tangent  to  an  envelope  which  defines  the  lower 
bound  on  the  maximum  delay  for  a  fixed  Yj/y- 

We  now  continue  our  discussion  of  the  optimal  resource  alloca¬ 
tion.  if  delay  is  the  only  measure  of  performance  and  we  want  to  allo¬ 
cate  window  buffers  and  transmission  capacity  to  minimize  the  delay, 
then  the  best  allocations  are  trivially  the  ones  which  prevent  any 
traffic  through  the  system  (e.g.  y  ^  =  y  and  w ^  =0,  or  y^  =0  and 
W|  =  w).  In  the  case  in  which  the  window  sizes  of  the  two  paths  are 
known,  we  must  allocate  all  of  the  capacity  to  only  one  of  the  paths  in 
order  to  minimize  the  delay.  This  fact  is  shown  in  Fig.  b.4-a;  depending 
on  the  value  of  "  j/(wj  +  w 7)  the  capacity  should  be  allocated  to  path  1 
or  2.  The  more  interesting  case  is  when  the  channel  capacities  of  the 
two  paths  are  known  and  we  want  to  allocate  a  total  of  w  windows  to  the 
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two  paths  such  that  the  delay  becomes  minimal.  When  /  0  and  y^  ^  0, 
then  we  have 

Theorem  6.5 

If  Yj  ?  0  and  y£  ^  0,  then  the  optimal  window  allocation  with 
respect  to  throughput  also  minimizes  the  delay. 

Proof 


Consider  two  paths  in  parallel  with  channel  transmission  rates 
Yj  t  0  and  y.,  f  0  and  window  sizes  w^  and  such  that 

Wj  +  =  w 

The  maximum  traffic  of  the  two  paths  in  parallel  is 

,  *  ,  *  .  * 

A  =  Aj  +  A2 

Corresponding  to  this  traffic  there  is  a  one-way  delay  T*.  The  system 
composed  of  the  two  paths  can  be  considered  as  a  network  with  maximum 
throughput  and  delay  A*  and  T*.  There  are  w  unacknowledged  messages  in 
the  system,  w/2  of  which  are  being  transmitted  to  the  destination,  there¬ 
fore,  by  Little's  result  [  LITT  61]  we  have 


T*  =  2pr  (6.25) 

ibis  equation  shows  that  the  particular  choice  of  w.'s  that  maximizes 
il  minimizes  the  delay  T*. 

Q.E.D. 

a.  virtue  of  this  theorem  we  can  use  the  results  of  the 
-  i.i  »  1 1  lo.  it  ion  with  respect  to  throughput  (Section  (>.2.2) 

•  !•  I  iv  in  well;  in  particular  liq.  (6.22)  gives  the 


*. 

... 
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window  allocation  which  optimizes  both  the  traffic  handling  capacity 
and  the  delay.  In  Fig.  6.4-a  the  optimal  window  allocation  for  a 

fixed  and  is  when  the  delay  curve  is  minimal  (note  that  this  does 

not  apply  to  the  case  when  either  Yj  or  Y2  is  zero).  Similar  to  Fig. 
6.2-b,  in  Fig.  6.4-b  the  delay  curve  for  the  window  allocation  which  is 
tangent  to  the  lower  bound  envelope  at  the  point  with  abscissa 
Yj/fii  +  Y2^  defines  the  optimal  window  allocation. 

So  far  our_optimal ity  criteria  have  been  either  the  maximal 
throughput  or  the  minimal  delay,  and  except  for  the  special  case  that 
we  studied  in  Section  6.2.2  (optimal  window  allocation  for  given  chan¬ 
nel  capacities),  the  solution  to  the  optimization  problem  was  different 
for  each  criteria.  In  order  to  combine  these  two  performance  measures 
in  a  single  optimization  problem,  we  need  to  define  a  proper  objective 
function  which  incorporates  these  two  criteria.  Such  functions  have 
already  been  used  in  this  dissertation.  In  Chapter  4  we  defined  the 
power  (,<jP)  as  the  ratio  of  throughput  and  delay  [GIES  76],  and  later 
in  Chapter  5  we  used  a  linear  combination  of  throughput  and  delay 
(aX-T;  a  0) .  Because  both  high  throughput  and  low  delay  characterize 
the  efficiency  of  a  network,  both  of  these  objective  functions  are  good 
candidates  for  the  objective  function. 

In  the  remainder  of  this  section  we  study  the  optimal  resource 
allocation  in  parallel  paths  where  the  optimality  criterion  is  both 
throughput  and  delay,  and  we  choose  to  use  power  as  our  objective 
function. 

Consider  two  parallel  paths  of  lengths  n^  and  n^  between  a 
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source  ami  destination.  For  a  window  and  capacity  allocation  such 
that  constraints  (6.13)  and  (6.14)  are  satisfied,  the  power  of  the 
system,  when  the  throughput  is  equal  to  the  traffic  handling  capacity, 


is 


where 


and 


&  =  L_ 

T* 


X*  =  AJ  +  A* 


1  T* 


X* 
X* 


The  optimization  problem  becomes 


(6.26) 


Maximize:  jp* 

With  respect  to:  ,  w^  ,  Yj  and  1 2 

Such  that: 


w^  +  w2  =  w 


and 


V,  *  y2  ■  Y 


This  nonlinear  optimization  problem  can  be  solved  by  an  appro¬ 
priate  mathematical  technique;  however,  instead  of  entangling  ourselves 
in  the  mathematical  formulation,  we  present  the  optimal  allocation 
through  a  graphical  presentation.  In  Fig.  6.5  we  have  shown  the  power 
of  the  two  parallel  paths  (of  lengths  n^  /w  =  .  1  and  n^  /w  =  .4,  w 

being  the  total  number  of  window  buffers  to  be  allocated)  as  a  function 
of  the  window  allocated  to  path  1  for  different  capacity  allocations. 
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2.5 
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For  0<Yj/y<  1  (or  0  <  Y2/Y  <  1)  there  is  a  value  of  w^/w  which 
optimizes  the  corresponding  curve.  We  have  already  established  the 
fact  that  for  these  values  of  y^  the  maximal  throughput  corresponds  to 
the  minimal  delay  (Theorem  6.5),  hence  the  particular  value  of  w^/w 
which  maximizes  also  maximizes  the  traffic  handling  capacity.  A*, 
and  minimizes  the  delay  T*. 

When  all  of  the  transmission  capacity  is  allocated  to  only  one 
of  the  paths,  then  only  one  path  can  carry  traffic.  We  have  already 
shown  in  Section  4.3  that  for  a  single  path  the  best  window  size  (with 
respect  to  power  ,9?*)  is  twice  the  path  length, 


w*  =  2n, 
n 


(6.27) 


Therefore,  when  only  one  of  the  paths  is  operational,  its  window  size 
should  be  chosen  so  that 

w.  =  2n. 

1  h. 


or  in  normalized  form 


if 


Y^Y  =  1 


In  Fig.  6.5  the  maximum  of  the  y^/Y  ~  1  curve  occurs  at 


w 


0.2 
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and  for 


the  maximum  occurs  at 


or 


Yj/Y  =  0 


0.8 


w 


Therefore,  when  power  is  the  optimality  criterion: 

i.  For  an  optimal  window  and  capacity  allocation  all 
the  capacity  should  be  allocated  to  one  of  the  paths  (the  shorter) 
and  the  window  size  should  be  chosen  according  to  Eq.  (6.27). 

ii.  For  an  optimal  capacity  allocation  (the  window  sizes 
are  known)  all  the  capacity  should  be  allocated  to  one  of  the  paths. 
The  decision  as  to  which  path  should  the  capacity  be  allocated  is 
based  upon  the  relative  values  of  w^'s  and  n^  's;  see  Fig.  6.5. 

iii.  For  an  optimal  window  allocation  (the  capacities  are 
known)  the  window  buffers  should  be  allocated  according  to  Eq.  (6.22). 
This  window  allocation  results  also  in  maximum  traffic  handling  capac¬ 
ity  and  minimum  delay. 


6.3  Buffer  Allocation  Schemes  in  a  Destination  Node 

Throughout  our  study  of  flow  control  we  have  concentrated  on 
a  single  source-destination  data  traffic,  and  have  used  the  window 
mechanism  as  the  main  tool  for  throttling  the  traffic  between  a  source 
and  destination.  In  Section  4.4  and  Chapter  5  we  demonstrated  how  the 
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loss  due  to  the  finiteness  of  the  destination  buffer  node  affects,  and 
in  a  way  controls,  the  traffic.  In  this  section  we  reflect  on  the 
effect  of  the  interference  among  a  number  of  data  streams  that  merge 
into  a  single  node,  and  study  how  a  further  control  can  be  imposed  on 
a  data  traffic  by  restricting  its  usage  of  tne  available  destination 
buffers. 

To  make  our  study  more  general,  we  consider  the  operation  of  a 
destination  node  in  isolation  from  the  network.  We  consider  a  node  in 
which  streams  of  messages  from  one  or  more  input  channels  are  buffered 
and  multiplexed  into  a  single  outgoing  channel.  Tiie  channels  may 
belong  to  one  network,  or  they  can  be  considered  to  be  data  streams 
from  different  networks  which  merge  together.  As  a  result  of  the  finite 
storage  size,  some  messages  may  be  lost  and  may  need  to  be  retransmitted 
from  their  source  nodes.  I>:  Chapter  4  we  found  that  retransmission  of 
messages  affects  the  useful  traffic;  therefore,  the  input  rate  of  mes¬ 
sages  on  the  channels  is  affected  by  this  loss.  To  simplify  the  analysis, 
we  assume  that  the  traffic  rates  on  the  input  channels  are  not  affected 
by  this  loss,  i.e.,  the  input  processes  are  stochastically  independent 
of  the  node. 

This  type  of  system,  generally  known  as  a  multiplexing  node, 
has  been  analyzed  by  others  in  one  form  or  another,  but  almost  all  of 
the  previous  studies  were  based  on  the  assumption  that  as  far  as  allo¬ 
cation  of  buffer  storage  is  concerned,  there  is  no  discrimination 
among  messages  from  different  input  channels  (which  we  will  call  differ¬ 
ent  classes  of  messages).  In  other  words,  a  pool  of  buffers  is  com¬ 
pletely  shared  among  different  classes  of  messages.  In  this  section 
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we  compare  some  other  storage  sharing  and/or  allocation  schemes. 

In  the  discussion  below,  we  assume  that  messages  are  trans¬ 
mitted  out  (served!  according  to  their  order  of  arrival  and  that  there 
is  no  priority  involved. 

The  simplest  scheme  is  Complete  Sharing,  (CS),  in  which  an 
arriving  message  is  accepted  if  any  buffer  storage  is  available,  and 
all  of  the  classes  of  messages  are  treated  uniformly  (Fig.  6.6-a).  The 
other  extreme  of  buffer  allocation  is  to  partition  the  storage  into 
fixed  blocks  of  buffers  and  assign  each  block  to  a  class,  which  we  will 
refer  to  as  Complete  Partitioning,  CP.  In  this  scheme  a  message  is 
rejected  if  the  storage  allocated  to  its  class  is  full  even  if  the 
entire  storage  is  not  completely  used  (Fig.  6.6-b).  Intuitively,  as 
long  as  the  throughput  or  the  probability  of  blocking  is  of  concern,  CS 
results  in  a  better  performance.  This  is  because  as  long  as  there  is 
a  buffer  available  it  can  be  used  by  any  incoming  message,  regardless 
of  its  class.  CS  results  in  a  higher  throughput  and  utilization;  how 
ever,  it  suffers  from  two  drawbacks.  The  first  is,  if  input  rates  are 
asymmetric,  the  classes  with  higher  input  rates  dominate  the  system 
and  consume  most  of  the  storage  and  although  the  total  throughput  and 
utilization  of  the  system  is  high,  the  contribution  of  the  low  input 
classes  to  this  throughput  may  be  unfairly  low.  In  other  words,  the 
high  input  classes  deny  others  the  use  of  the  system  [KAMO  76A]. 
Another  drawback  is  that,  even  with  uniform  input  rates,  if  all  of  the 
input  rates  are  scaled  up  simultaneously  by  a  factor  (say  n),  as  p 
increases  to  infinity,  in  contrast  to  intuition,  there  is  a  nonzero 
probability  that  any  one  of  the  classes  of  messages  will  be  absent  from 
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the  storage.  This  phenomena  may  not  be  desirable  in  certain  situations, 
as  we  may  require  that  in  the  case  of  infinite  input  rates  there  should 
be  a  nonzero  probability  that  one  or  more  messages  of  each  class  be 
present  in  the  system. 

The  above  considerations  lead  us  to  impose  some  restriction 
on  the  contention  for  the  space.  The  first  drawback  may  be  remedied  by 
imposing  a  limit  on  the  number  of  buffers  that  can  be  occupied  by  any 
class  simultaneously.  The  scheme  which  incorporates  this  idea  will  be 
called  Sharing  with  Maximum  Queue,  (SMXQ),  Fig.  6.6-c.  Even  with  SMXQ 
we  are  not  guaranteed  to  have  remedied  the  second  drawback,  namely,  a 
nonzero  probability  of  being  absent  from  the  system  at  a  very  high 
input  rate  of  all  classes.  In  order  to  remove  this  deficiency  we  have 
to  allocate  a  minimum  number  of  buffers  to  each  class.  This  leads  us 
to  a  scheme  which  has  been  called  Sharing  with  Minimum  Allocation, 

(SMA) .  In  this  scheme,  besides  the  minimum  allocated  buffers  to  each 
class,  there  is  a  pool  of  buffers  to  be  shared  by  different  classes  of 
messages.  If  this  sharing  is  done  without  any  limitation,  the  scheme 
will  be  simply  called  SMA  (Fig.  6.6-d).  If,  however,  we  impose  some 
limitation  on  the  maximum  number  of  messages  of  each  class  that  can  be 
present  in  the  system,  then  the  scheme  will  be  called  Sharing  with 
Minimum  Allocation  and  Maximum  Queue,  SMAMXQ  (Fig.  6.6-e). 

Statistical  multiplexors  have  been  modeled  and  analyzed  by 
others,  [CHU  72  and  the  references  therein],  but  the  problem  wc  are 
addressing  here  has  not  been  studied  before.  Rich  and  Schwartz 
(RICH  75),  and  Drukey  [DRUK  75]  made  a  preliminary  study  of  buffer 
allocation  in  a  Store- and- Forward  (S/F)  node.  A  comprehensive  study 
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of  buffer  sharing  in  S/F  nodes  was  carried  out  by  Kamoun  and  Kleinrock 
| KAMO  7bA  and  Bj.  Although  most  of  the  terminology  used  in  this  paper 
is  from  these  last  two  works,  the  present  study  is  different  from 
theirs  since  in  a  S/F  node,  each  class  of  input  stream  has  a  dedicated 
output  channel,  wheieas  in  a  statistical  multiplexor  a  single  output 
channel  is  shared  by  all  of  the  input  messages.  This  introduces  major 
differences  in  the  analysis. 

We  model  the  system  by  a  single  server  (i.e.,  one  channel) 

queueing  system  with  finite  waiting  room  of  size  B.  The  input  to  the 

system  consists  of  R  independent  Poisson  streams  with  rates  A  ,  (r  =  J , 

2,  ...,  R).  In  order  to  analyze  the  system,  we  require  that  all  of  the 

message  lengths  be  distributed  exponentially  with  the  average  length 

1/p  bits.  Assuming  the  output  channel  speed  to  be  C,  the  service  time 

is  exponentially  distributed  with  rate  pC.  We  further  assume  that  any 

arriving  messages  which  are  not  accepted  leave  the  system  without  service 

and  the  accepted  ones  are  served  on  a  first-come-first-served  basis.  In 

the  discussion  below,  a  and  b  are  the  minimum  number  of  buffers  allo- 

r  r 

cated  and  the  maximum  queue  length  of  class  r  messages,  respectively. 

The  above  allocation  schemes  are  analyzed  in  Appendix  D,  where 
"■^we  have  presented  either  explicit  expressions  or  numerical  algorithms 
to  calcdtate  different  performance  measures,  namely,  probability  of 
blocking,  system^tktpughput ,  etc. 

In  the  remainder  ^Sf^this  section  we  present  some  performance 
curves  and  compare  the  different  shafing^ schemes.  In  all  of  the  numer¬ 
ical  cases  we  study,  we  assume  R  =  2  (i.e.,  there  are  two  input  classes). 
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Figures  6.7-a  and  6.7-b  show  the  behavior  of  the  (normalized) 
throughput  and  the  probability  of  blocking  with  respect  to  the  allocated 
storage  when  the  CP  scheme  is  used.  In  these  figures  a  pool  of  10  buf¬ 
fers  is  shared  by  two  classes  of  messages.  The  horizontal  axis  repre¬ 
sents  the  number  of  buffers  allocated  to  the  first  class,  (hence 
a^  =  10- ap.  It  can  be  seen  that  there  is  an  optimal  allocation  of 
storage  which  maximizes  the  total  throughput  (or  minimizes  the  blocking 
probability).  The  optimal  partitioning  of  storage  depends  on  the  values 
of  the  input  rates.  A  good  approximation  to  this  partitioning  is  to 
divide  the  storage  according  to  relative  values  of  the  input  rates 
(i.e.,  a^  =  ( A j /  (A^  +  and  a^  =  (A^/ (X^  +  A0))B.  The  performance 

curve  is  flatter  for  larger  input  rates.  This  means  that  deviation 
from  the  optimal  partitioning  does  not  affect  the  total  throughput 
severely.  Therefore,  as  can  be  seen  from  the  curves  for  pj  and  p' 

(p^  is  the  normalized  throughput  of  class  r  messages),  one  is  able  to 
control  the  contribution  of  each  class  to  the  output  by  changing  the 
partitioning  of  the  storage  with  little  degradation  of  the  performance. 
For  example  in  Fig.  6.7-a,  a^  =  6  and  a^ =  4  results  in  an  almost  equal 
contribution  to  the  output  by  each  class,  and  the  drop  in  the  maximum 
throughput  (which  is  achieved  at  a^  =  3  and  7)  is  negligible.  Also 
we  have  shown  the  blocking  probability  and  the  throughputs  for  the 
complete  sharing  scheme  in  these  figures  (the  dashed  lines).  Note  that 
the  total  throughput  of  CS  (which  is  about  0.99)  is  larger  than  the 
maximal  throughput  of  the  CP  scheme.  If  we  desire  to  provide  each  of 
the  classes  with  a  minimum  throughput,  however,  the  CS  scheme  fails  to 
guarantee  this  level  of  service.  As  we  will  see  shortly,  in  CS  any  one 


of  the  input  streams  has  the  potential  of  monopolizing  the  system  and 
denying  others  the  use  of  the  buffer  storage. 

In  Fig.  6.8  a  pool  of  1)  =  6  buffers  is  shared  according  to  the 
SMXQ  scheme.  Notice  that  we  have  the  following: 

i.  If  bj =  =  3  then  SMXQ  is  equivalent  to  CP. 

ii.  If  bj =  b^ =  4  then  SMXQ  is  equivalent  to  SMA. 

(a1  =  a2  =  2) 

iii.  If  bj =  b2 = 5  then  SMXQ  is  equivalent  to  SMA. 

(al=  a2  =  1) 

iv.  If  bj =  b2 =  6  then  SMXQ  is  equivalent  to  CS. 

The  curves  show  the  normalized  throughput  of  each  class  when  X^  remains 

fixed  and  0.3  and  X2  varies.  It  is  seen  how  p'  degrades  when  bj  =b7=  6 
(C.S) .  In  fact,  as  X2t°9,  p^+0.  Fig.  6.8  shows  how  one  can  prevent 
this  phenomena  by  permanently  allocating  some  buffers  to  class  1  messages. 
The  asymptotic  values  of  p'  and  p'  can  be  calculated  according  to  the 
analysis  given  in  Appendix  0. 

Jig.  6.9  shows  the  probability  of  being  absent  from  the  system 
as  a  function  of  the  input  rate  to  the  system.  In  this  figure  the  two 
input  rates  are  identical  (i.e.,  X  =  X^ +  X7  =  2Xj  =  2 X2 ) .  The  horizontal 
axis  represents  the  total  input  rate  X.  Again,  a  pool  of  B=  6  buffers 
is  shared  according  to  the  SMXQ  scheme  as  in  Fig.  6.8.  As  X  increases, 
all  curves  except  the  CS  curve  eventually  decrease  to  zero. 

As  we  pointed  out  earlier,  if  the  measure  of  performance  is  the 
total  throughput  or  the  probability  of  blocking,  then  the  CS  scheme 
results  in  a  better  performance.  This  fact  is  shown  in  Fig.  6.10,  in 
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which  a  total  of  B=6  buffers  are  shared  as  in  Fig.  6.8.  In  this  figure 
the  two  input  rates  are  identical  (Aj  =  A^),  and  the  normalized  through¬ 
put  of  class  1  is  plotted  as  a  function  of  its  input  rate  A.  (Notice 
that  the  total  throughput  is  equal  to  2p'.)  This  figure  indicates  that 
the  degradation  of  the  throughput  from  its  maximum  (which  is  achieved 
by  CS)  due  to  the  constraint  on  the  usage  of  the  buffer  is  very  slight. 
Hence,  while  the  other  schemes  can  control  the  storage  utilization  (and 
therefore  control  the  flow)  of  the  data  streams,  they  do  not  cause  the 
total  throughput  to  fall  off  too  much  from  its  maximum.  For  completeness 
we  have  also  presented  the  total  average  delay  of  each  class  in  Fig.  6.11. 

We  conclude  that  although  the  CS  scheme  always  results  in  higher 
total  throughput  and  lower  blocking  probability,  in  situations  where 
other  measures  of  performance  are  important  (e.g.,  throughput  of  indi¬ 
vidual  message  classes),  one  should  consider  other  buffer  sharing  schemes 
as  well.  In  general,  the  final  decision  must  be  based  on  the  particular 
parameters  of  environment.  (The  results  presented  in  this  section  were 
originally  published  in  [KliRM  77].) 

6.4  Conclusion 

In  this  chapter  we  covered  three  issues  related  to  computer 
network  interconnection.  We  started  by  characterizing  systems  composed 
of  series  and/or  parallel  connection  of  computer  networks  and  developed 
a  network  algebra  as  an  aid  in  the  systematic  study  of  internetting. 

For  a  series  connection  of  networks  it  was  shown  that  the  throughput  is 
determined  by  a  bottleneck  net,  the  network  with  the  smallest  traffic 
handling  capacity.  In  this  case,  part  of  the  resources  of  the  other 
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networks  is  wasted.  The  analysis  of  networks  connected  in  parallel  was 
carried  out  only  for  the  special  case  in  which  the  maximum  load  factors 
of  the  connected  networks  are  identical.  We  showed  that  a  system  com¬ 
posed  of  a  parallel  connection  of  such  networks  is  identical  to  a  single 
network  with  a  path  length  and  channel  capacity  equal  to  the  sum  of  the 
path  lengths  and  the  sum  of  the  channel  capacities  of  its  components, 
and  its  maximum  load  factor  is  equal  to  the  maximum  load  factor  of  each 
compenent  network. 

The  second  subject  we  studied  was  the  problem  of  optimal  allo¬ 
cation  of  limited  resources  (window  buffers  and/or  transmission  capacity) 
among  a  number  of  parallel  communication  paths.  It  was  shown  that  the 
solution  to  this  problem  depends  on  the  optimality  criteria.  We  studied 
this  optimization  problem  with  respect  to  three  different  criteria: 
throughput,  delay  and  throughput -delay.  For  the  throughput -del ay  cri¬ 
teria  we  chose  power  (defined  as  the  ratio  of  throughput  and  delay)  as 
our  measure  of  performance.  It  was  shown  that  only  for  a  special  case 
(optimal  window  allocation  between  parallel  paths  when  the  capacities 
are  known)  are  the  solutions  the  same  with  respect  to  the  three  perfor¬ 
mance  measures. 

The  last  issue  we  covered  in  this  chapter  was  concerned  with 
buffer  allocation  in  a  destination  node  where  a  number  of  communication 
paths  or  nets  merge.  This  subject  was  carried  out  in  a  more  general 
context  of  buffer  allocation  schemes  in  a  multiplexing  node.  Different 
allocation  schemes  were  analyzed  and  their  performances  were  compared 
with  each  other.  (These  schemes  were  originally  proposed  in  [KAMO  76A 
and  76B]  for  a  store-and- forward  node.)  It  was  demonstrated  how  further 
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control  on  the  flow  of  data  can  be  achieved  through  different  allocation 
and/or  sharing  schemes.  These  controls  are  supplementary  to  the  static 
and  dynamic  flow  controls  which  we  studied  in  the  previous  chapters. 
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CHAPTER  7 


CONCLUSIONS  AND  SUGGESTIONS  FOR  FURTHER  RESEARCH 

This  research  was  motivated  by  the  need  to  develop  efficient 
techniques  to  enhance  the  traffic  handling  capability  of  computer  net¬ 
works.  This  capability  is  usually  expressed  in  terms  of  "throughput" 
and  "delay",  which  are  affected  by  both  the  network  switching  and  flow 
control  techniques.  This  dissertation  is  addressed  to  the  analytic 
modeling  and  performance  evaluation  of  these  techniques.  Except  for 
some  suggertions  regarding  the  physical  realization,  we  have  ignored 
most  of  the  implementation  problems  of  the  techniques  we  have  analyzed. 
The  analytic  models  which  we  used  are,  in  many  cases,  rather  simplistic; 
nevertheless,  they  help  in  understanding  the  implications  of  the  real 
system  behavior. 

This  dissertation  can  logically  be  divided  into  two  parts.  In 
the  first  part  (Chapters  2  and  3),  we  studied  switching  problems.  In 
the  second  part  (Chapters  4  through  6)  the  flow  control  problems  in 
computer  networks  were  discussed. 

The  main  conclusions  of  this  research  are  as  follows: 

(1)  The  hybrid  switching  technique,  virtual  cut-through,  is  a 
fine  solution  to  the  switching  problem  in  computer  networks.  The  net¬ 
work  delay  for  cut-through  switching  is  almost  always  better  than  that 
of  message  switching;  only  when  the  channel  error  rates  are  high,  does 
message  switching  result  in  a  smaller  delay. 


(2)  Our  tradeoff  studies  showed  that  the  selection  of  a  net¬ 
work  switching  technique  should  be  based  on  a  careful  study  of  the  net¬ 
work  topology  (especially  the  communication  path  lengths)  and  traffic 
characteristics  (message  lengths  and  message  arrival  rates). 

(3)  The  contention  effect  in  computer  networks  was  explicitly 
demonstrated  in  our  study  of  static  flow  control.  It  was  shown  that  in 
order  to  improve  the  throughput-delay  performance  of  a  network,  window 
size  and  timeout  should  be  carefully  selected. 

(4)  In  Chapter  5  it  was  shown  that  the  throughput-delay  per¬ 
formance  of  a  network  can  be  further  improved  by  dynamically  adjusting 
the  window  size  and  certain  other  network  parameters  to  the  stochastic 
load  fluctuations.  In  that  chapter  we  presented  a  methodology  to 
implement  a  dynamic  flow  control . 

(5)  In  Chapter  6  some  issues  in  interconnection  of  computer 
networks  were  addressed.  A  network  algebra  for  series  and/or  parallel 
connection  of  networks  was  developed.  It  was  shown  that  in  series  con¬ 
nection  of  networks,  the  bottleneck  network  determines  the  throughput 
of  the  system.  The  study  of  parallel -connected  networks  was  carried 
out  only  for  the  special  case  where  all  the  networks  have  the  same  load 
factor  (£*  =  w/n^). 

(6)  The  solution  to  the  optimal  allocation  of  limited 
resources  (window  buffers  and  channel  capacity)  among  a  number  of  paral¬ 
lel  networks  depends  on  the  optimality  criteria,  throughput,  delay,  and 
throughput-delay.  It  was  shown  that  only  in  a  special  case  was  the 
solution  with  respect  to  the  three  performance  measures  the  same.  In 
many  cases  the  optimal  allocation  of  window  buffers  and  channel  capacity 


is  to  assign  all  the  resources  to  only  one  of  the  networks. 

(7)  Our  study  of  buffer  allocation  schemes  in  a  node  where  a 
number  of  networks  meTge  together  showed  that  in  order  to  provide  a  fair 
grade  of  service  to  different  streams  of  data  which  merge  together, 
some  nontrivial  sharing  and  allocation  schemes  should  be  implemented. 

Throughout  the  dissertation  numerical  examples  were  presented 
for  certain  values  of  different  parameters.  The  models  and  methodolo¬ 
gies  are,  however,  independent  of  these  particular  values  and  may  be 
applied  to  systems  with  different  parameters. 

Extensions  for  Further  Research 

While  there  is  an  abundant  literature  on  issues  related  to  flow 
control  in  computer  networks,  the  analytic  and  quantitative  study  of 
flow  control  is  still  in  its  infancy.  In  the  analytic  models  which  we 
used  for  the  window  mechanism,  the  assumption  was  that  the  sending  user 
always  has  a  message  to  transmit;  furthermore,  in  our  analysis  there  was 
no  account  for  the  admission  delay  to  the  network.  These  assumptions 
may  be  relaxed  in  order  to  make  the  model  more  realistic.  More  impor¬ 
tant,  our  model  is  based  on  the  assumption  of  single  packet  messages  in 
which  the  problem  of  message  sequencing  was  not  considered.  Further 
research  and  a  more  refined  model  is  necessary  to  investigate  the  impor¬ 
tant  problem  of  message  sequencing. 

The  Markov  decision  process  which  was  formulated  in  Chapter  5 
appears  to  have  extended  applications  to  decision  processes  in  which 
there  is  a  lead-time  for  a  decision  to  become  effective,  for  example, 
in  inventory  systems.  The  identification  of  other  systems  for  which 
our  formulation  is  applicable  represents  a  worthwhile  investigation. 
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LIST  OF  OFTEN-USED  SYMBOLS 

Average  number  of  messages  entering  network  per  second  with 
origin  j  and  destination  k. 

Average  number  of  messages  entering  i**1  channel  per  second. 
Average  number  of  messages  entering  a  channel  in  a  balanced  net. 
Maximum  input  rate  or  maximum  throughput . (Chap.  4,6) 

Total  arrival  rate  of  messages  from  external  sources . (Chap. 2) 
Maximum  transmission  rate  of  a  channel  («pC) 

Total  capacity  of  a  link. 

Average  message  length. 

=l/pC  =  average  transmission  time  of  a  message  over  a  channel. 
Average  transmission  time  of  a  header. 
th/tQ  (Chap.  2) 

Tl/ral  (Chap.  2) 

Number  of  channels  per  link. 

A  random  variable  representing  the  number  of  hops  a  message 
goes  through. 

=  Pr[n^=n] 

Average  of  n^. 

fL 

z  transform  of  number  of  hops  a  message  goes  through  =  E[z  n] • 
Number  of  cut-throughs  a  message  experiences  (nc  <_  -  l) . 

Average  number  of  cut-throughs. 
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Efz  I  nh  =  n,  p). 

E[z"c  |  P]- 
Network  delay. 

Average  network  delay  when  message  switching  is  used. 
Average  network  delay  when  cut-through  switching  is  used. 
Maximum  delay.  (Chap.  4,  6) 

End-to-end  delay. 

Acknowledgement  delay. 

Round-trip  delay. 

Probability  of  waiting. 

Channel  error  rate. 

Probability  of  loss- 
Probability  of  retransmission- 

Eftotal  amount  of  storage  required  on  a  path  in  message 
switching] . 

E[total  amount  of  storage  required  at  a  node]. 


S(P)  Same  as  S^,  but  for  the  cut-through  system, 
c  m 

S  Same  as  Sm  ,  but  for  the  cut-through  system. 

Z..  Average  message  delay  for  messages  with  origin  j  and 

destination  k. 

s  Total  delay  in  a  node, 

w  Waiting  time  in  a  node, 

p  The  utilization  factor  of  each  channel, 

it  Load  factor. 
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APPENDIX  B 


THEOREMS  AND  PROOFS  FOR  CHAPTER  2 


I 

' 


1 


i 


> 


B.l  Multiple  Exponential  Channels 

As  we  have  considered  a  communication  network  in  which  adjacent 
nodes  are  connected  via  multiple  channel  links,  we  need  the  queueing 
results  for  a  multiple  server  system.  We  define  P  as  the  ratio  of  the 
average  input  data  rate  to  the  maximum  transmission  rate 


where  C  is  the  total  capacity  of  a  link  which  is  divided  into  Nch  chan¬ 
nels.  The  solution  for  P^,  the  probability  of  having  k  messages  at  a 
node  when  p<  1  and  there  are  N  ^  channels  (servers)  is  given  below 


[KLEI  75]. 

k.,k 
p  Nch 
k! 

k  <  N  . 

—  ch 

Pk 

i 

k  N, 

P  (Nch)  ch 

k  >  N  . 

-  ch 

Nch! 

where 

"O 

o 

II 

Pch-1 

£ 

L  k=0 

«Nchp)k 

<Nch‘»Nc1’ 

k! 

(1  -  P)Nch!  J 

(N  p)ch  (B.3) 

Pw  =  PrC-  Nch]  =  Prt"aiting]  =  PQ  (l-~P)'N'chT 

Nch  Pw 

T  =  E[time  spent  in  a  node]  =  ♦  pC~(j' .  p)  (B.4) 
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N  =  AT  =  N  .  p  + 
ch 


P 

w 


1  -  p 


P 


(B.5) 


B.  2 


Theorem  2.3  and  its  Ptoof 


The  average  network  delay  for  the  cut-through  system  ir  a  net¬ 
work  with  noisy  channels,  conditioned  on  the  path  length  (h,j  =  n),  is 
given  by 

Tc  *  (B.6) 

where 


^{Tcc(i*j)  +  T^-^CiKl-Pe) 


k+1 


♦  T^j)(j)  r  1  -  (1  -  Pe)k+1l  }p.!j)(k)  i>  j  >  o 


T<j)(i)  - 


-  Pc)k*‘  1  fJ(H 

k=U  l _ J _ 

(i-p jk+1p(j)(k) 


i  =  j  >  0 


where 


and 


E[s]  +  kt^  = 


T^(iJ)  = 


ch  w 

pC  pC(l  -  pe) 


E[s  |  w  >  0]  .  kth  .  -g!  ,  -ErTL_T  .  kth 


♦  kt. 


P(n)(k)  = 
c 


( 


j  =  0 
(B.  7) 

i  =  j 

i  >  j 

(B.8) 


(1-  P  )  P  if  0  <  k  <  n  -  1  and  n>  1 
w  w  — 


Cl  -  p  ) 
w 


if  k  =  n  -  1 


(B.9) 


where  p  is  the  effective  traffic  rate, 
e 
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Proof : 


1 


We  start  with  some  notation: 


hcc:  number  of  continuous  cut-throughs 


>(n) 


P^"'  (k)  =  Pr[iicc  =  k  |  =  n]  ;  density  function  of  ncc 

T^(i)  =  E[delay  in  the  cut-through  system  |  path  length  =  i 
and  (i-  j)  hops  have  been  made  through  (j  hops 
remain  to  go)]  1  <  j  i 

T^(i,j)  =  E [delay  when  a  message  makes  k  continuous  cuts  |  path 
cc 

length  =  i  and  (i  -  j)  hops  have  been  made  through 
(j  hops  remain  to  go)  ]  1  <  j  <  i  and  0  _<  k  <_  j  -  1 . 

The  density  function  P^1^  (k)  is  given  by  Eq.  (B.9). 

We  first  prove  Eq.  (B.8).  Let  us  consider  a  tagged  message. 
If  i>j,  this  indicates  that  after  having  been  transmitted  over  the 
first  (i  -  j)  hops,  the  message  is  now  blocked  in  an  intermediate  node. 
This  implies  that  it  should  incur  some  queueing  delay  before  it  starts 
transmission  again.  On  each  of  the  k  nodes  that  it  cuts  through,  the 
message  incurs  t^  seconds  of  delay  (only  for  assembling  its  header). 
Therefore  we  have 


(i.j)  =  E[s|w>0]  +  kth  i>j 

On  the  other  hand,  i=  j  implies  that  the  message  is  at  the 
first  node  of  a  path.  So,  its  time  of  residency  in  the  first  node  is 
not  conditioned  on  any  event  and  we  have 

T^(i,i)  =  E[s]  ♦  kth  i=j 

This  gives  us  Eq.  (B.8). 
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To  prove  Eq.  (B.7)  we  use  a  renewal  type  argument.  On  a  path 
consisting  of  i  hops,  assume  our  tagged  message  has  j  hops  to  go.  If 
the  message  makes  k  continuous  cuts  (with  probability  P^*(k))  and  is 
received  at  a  final  node  (see  Section  2.3.2  for  the  definition  of  a  final 

k 

node)  and  has  no  bits  in  error,  then  it  will  incur  Tcc(i,j)  seconds 
delay  and  will  have  (j  -  k  -  1)  hops  to  go.  Delay  for  the  remaining  trans- 
mission  is  ^  ( i ) .  On  the  other  hand,  if  the  message  has  some  bits 

with  error,  it  must  be  retransmitted  over  the  remaining  j  hops;  however, 
for  retransmission  its  delay  in  the  first  node  will  not  be  a  conditional 
delay,  and  it  is  as  if  the  message  were  at  the  first  node  of  a  j  hop 
path.  Putting  what  we  have  said  in  mathematical  form  we  have: 

T(^(i  |  n  =  k  and  no  error)  =  (i,  j)  +  k '^  ( i) 


cc 


cc 


T^(i  |  n  =  k  and  error)  =  TlK;(i,j)  +  TUJ(j) 
c  v  1  cc  cc  J  c  J 


.(k), 


,(j), 


so 


T(j)(i  In  =  k)  = 
c  1  cc 


Finally  we  have 


,_(j),.  i  ~  t.  i  ,  „  n  no  error  on) 

c  1  cc  L  k  +  1  hops  J 

„(i),.  i  ~  ,  ,  .  vn  least  one  error! 

+  T  J  (l  n  =  k  and  error)  x  Pr  .  ,  , 

c  1  cc  L  on  k  +  1  hops  J 

-  +  Tcj”k_1)  (i)]  [d  -  Pe)k+1] 


*  [Tcc){i,j)  *  Tcj)«)][‘  - 


T^j)(i)  «  ^  I  nr  =k)p'j;(k) 


(j), 


k=0 


cc 


After  some  algebra  and  collection  of  terms  we  get  Eq.  (B.7).  Oq.  (B.6) 
is  simply  an  identity. 

Q.G.D. 
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I 


B. 3  Theorem  2.4  and  its  Proof 

For  the  cut-through  system  in  a  balanced  network  with  noisy 
channels  the  useful  traffic  is  related  to  the  effective  channel  traffic 
according  to  the  following  relationships  (pg  is  the  effective  traffic 
and  n  is  the  path  length) . 


P  * 


Nt(n)  pc 


(B. 10) 


where 

(k+  1)  ♦  Nt(i-  k-  1) (1  -  Pe)k+1 

Z  (1  -  PJk+1Pc(i)(k) 

k=0  e  (B. 11) 


Nt(i)  =0  i  =  0 


Proof: 


Let 


N  (i)  =  E [number  of  times  a  message  is  trans- 
*  mitted  on  a  path  of  length  i] 


Notice  that  if  channels  are  noiseless,  then  there  is  no  retrans¬ 
mission  and  we  have  N^(i)  =  i.  Obviously  the  useful  traffic  will  be 
related  to  effective  traffic  by  Eq.  (B.10). 

To  prove  Eq.  (B.ll)  we  use  a  renewal  type  argument: 

N_(i  I  n  =  k  and  no  error)  =  k  +  1  +  N(i-k-l) 
t  1  cc  t 

N. (i  I  n  =  k  and  error)  =  k  +  1  ♦  N_(i) 
t  '  cc  t 
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so 

Nt(i  I  ncc  =  k)  =  (k  ♦  l*Nt(i-k-  l))n  -  Pe)k^ 

♦  (k*  1  +  Nt(i))[l  -  (1  -  Pe)k+1] 

Removing  the  condition  on  the  number  of  continuous  cuts  we 

have 


After  some  algebra  and  collecting  similar  terms  we  get  Eq.  (B.llj. 

For  i  =  0,  because  there  is  no  transmission  involved,  we  have 

Nt(0)  =  0 


When  P  =  0  it  can  easily  be  shown  that  N  (i)  =  i  and  p  =  p  . 
C  t  “ 


Q.E.D. 


APPENDIX  C 


SOME  RESULTS  FROM  MARKOV  DECISION  THEORY 

The  following  results  are  taken  from  Jewell  [JEWE  03],  Howard 
[IIOWA  60,  71]  and  Ross  [ROSS  70]. 

Consider  a  stochastic  process  X(t)  which  is  observed  at  time 
points  t =  0,  1,  ...  to  be  in  one  of  a  number  of  possible  states.  The 
set  of  states  S  will  be  labeled  by  non-negative  integers  0,  1,  2,  ....  N. 
Let  A  be  a  finite  set  of  actions  such  that  corresponding  to  each  a cA, 
a  transition  probability  p^(a),  and  a  reward  r„(a)  are  uniquely  speci¬ 
fied.  We  assume  that  the  rewards  are  bounded,  i.e.,  |r„(a)|<°°  for 
all  a,  i  and  j.  When  process  X(t)  is  in  state  i  at  time  t  and  action  a 
is  chosen,  then 

1.  The  next  state  of  the  process  is  chosen  according  to  the 
transition  probabilities  of  p_(a). 

2.  On  transition  from  state  i  to  j  under  action  a,  the  process 

earns  a  reward  r..(a). 

ij 

Let  a(t)  be  the  action  chosen  at  t;  then  assumption  1  is  equivalent  to 
stating  that 

Pr[X(t+l)  =  j  |  X(0),  a(0) ,  X(l),  a(l),  ....  X(t)  *  i,  a(t)  =  a]  =  p.^a) 

(C.l) 

This  shows  that  both  the  cost  and  the  transition  probabilities  are 
functions  of  only  the  last  state  and  the  subsequent  action. 
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The  actions  should  be  taken  according  to  a  rule.  We  define  a 
pol icy  f  to  be  any  rule  for  choosing  an  action  and  #*to  be  the  class  of 
all  policies.  The  action  chosen  by  a  policy  may,  for  instance,  depend 
on  the  history  of  the  process  up  to  that  point,  or  it  may  be  randomized 
in  the  sense  that  it  chooses  an  action  with  some  probability  P  ,  azA. 

For  state  i  and  action  a  chosen  at  that  state,  we  define 

A  N 

Ma>  =  L  P, ,  (a)r  Ca)  i  =  0,  1,  2.  ....  N  (C.2) 

1  j=0  13  13 

R^fa)  can  be  interpreted  as  the  reward  to  be  expected  in  the  next  tran¬ 
sition  out  of  state  i;  it  will  be  called  the  expected  immediate  reward 
for  state  i,  when  action  a  is  taken. 

Suppose  action  a(t)  is  chosen  by  policy  f  at  time  t.  This  in 
turn  specifies  the  state  transition  probabilities  and  reward  at  time  t. 
Thus,  f  determines  both  the  evolution  in  time  of  the  Markov  process  X(t) 
and  the  sequence  of  rewards  it  earns.  For  a  policy  f,  the  expected 
total  reward  that  the  process  X(t)  earns  in  the  next  n  +  1  time  units 
when  it  starts  in  state  i  is 


r  n 


V“(f)  -  Ef 


£  Rx(t)U(t))  I  x(o)  =  i 


Lt=o 


(C.3) 


where  th^  expectation  is  taken  with  respect  to  f .  We  define  the  average 
reward  per  unit  time  for  X ( t )  which  was  initially  started  in  state  i  as 


£  Vt)(a(t))  1  X(0)  =  1 


Lt=o 


(C.  4) 


where  the  limit  exists,  since  the  rewards  are  assumed  to  be  bounded. 
We  say  policy  f*  is  the  average  cost  optimal  over  all  policies  if 
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g.(f*)  =  iiuix{g.(f)}  i  cS,  f  c  (C.5) 

x  f  i 

An  important  sub-class  of  the  class  of  all  policies  is  the 

class  of  stationary  policies,  where  a  policy  is  said  to  be  stationary 

if  it  is  non -randomized  and  the  action  it  chooses  at  time  t  only  depends 

on  the  state  of  the  process  at  time  t.  Thus,  a  stationary  policy  f  is 

a  function  from  state  space  to  action  space,  i.e.,  f(*):S-*-j4.  It  easily 

follows  from  Eq.  (C.l)  that  if  a  stationary  policy  f  is  employed,  then 

the  sequence  of  states  (X(t),  t =  0,  1,  2,  ...}  forms  a  Markov  chain  with 

transition  probabilities  {p— }  *  {p^ ^  ( -f  (i) ) }  and  it  is  for  this  reason 

that  the  process  is  called  the  Markov  Decision  Process  [ROSS  70] .  In 

the  following  we  deal  only  with  stationary  policies,  and  we  refer  to 

this  class  of  policies  as  F.  Because  a  stationary  policy  uniquely 

defines  the  action  for  a  state,  say  i,  we  use  R^(f)  and  p^(f)  instead 

of  K.(f(i))  and  p  (f(i)),  respectively.  Let  P(f)  =  (R.(f)},  P(f)  = 

1  ij  i 

{pjj(f)},  and  =  {g^(f)  )  ,  then  Eq.  (C.4)  can  be  written  equivalently 

as  follows. 

*(f)  =  ii»|=-rr  £  PCfJl^cf)}  (c.6) 

n+oo  (  t=0  ) 

where  (P(f)]  ,  the  t  power  of  P(f) ,  is  the  t-step  probability  state 
transition  matrix  under  the  stationary  policy  f. 

In  [HOWA  60]  it  is  shown  that  for  a  long  time  horizon  n,  the 
expected  total  reward  of  the  process  under  stationary  policy  f,  when  it 
starts  from  state  i  is  given  asymptotically  by 

v"(f)  =  g.(f)n  ♦  v.(f)  (C. 7) 
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where  v^(f)  is  referred  to  as  the  asymptotic  intercept  of  state  i.  For 
large  n,  only  the  rate  g^f)  is  important  for  evaluation  of  v"(f). 

For  each  stationary  policy  f,  since  S  is  finite,  if  X(t)  is  an 
irreducible  Markov  chain  then  X(t)  possesses  a  unique  stationary  state 
probability  distribution  {tt^  (f ) }  such  that  [FELL  50} 

N 

H.  =  Z  niPii(f)  j  =  0.  1,  2 . N 

}  i=0  A 

n.  >  0  i  =  0,  1,  2,  ....  N  (C.8) 

and 

N 

It.  =  1 

l 

From  the  ergodic  theorem  in  the  theory  of  Markov  processes  we 
have  [CHUN  67],  [HOWA  60,71] 

N 

g(f)  =  g,(f)  =  E  n.R.(f)  (C.9) 

1  j=0  }  3 

where  g(f)  is  defined  as  the  cost  rate  or  the  expected  cost  rate  of  the 
process  X(t).  For  ergodic  chains  and  stationary  policies  Eq.  (C.5) 
simplifies  to 

g(f*)  =  max  g(f)  (C.10) 

icF 

I-q.  (C.7)  shows  that  the  reward  gained  by  the  process  is  maximal  when 
it  starts  in  any  state  if  the  actions  are  chosen  under  stationary  policy 
f*.  For  a  finite  state  space,  if  every  stationary  policy  results  in  an 
irreducible  Markov  chain,  then  such  an  optimal  policy  exists  [BLAC  62], 
[JEWE  63],  [ROSS  70],  [SOBE  73]. 


E 

i=0 
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Given  a  stationary  policy  f,  the  reward  rate  of  the  resulting 
Markov  chain  can  be  determined  by  the  following  equations  [HOWA  60,  71] 


N 

g(f)  ♦  v .(f)  =  R.(f)  +  £  Pj  ,(f)v  (f) 

j=0  J  J 


i  =  0, 


(C.ll) 

The  set  of  (N +  1)  linear  simultaneous  equations  are  not  sufficient  to 
determine  the  v's  and  g(f).  In  fact,  except  for  g(f),  only  relative 
values  cf  the  v's  can  be  determined.  Fortunately,  a  set  of  relative 
values  of  the  v's  is  sufficient  for  the  purpose  of  the  following  iter¬ 
ative  method  in  solving  for  an  optimal  policy. 


The  Policy-Iteration  Method  [HOWA  60,  71] 

Based  on  Eq.  (C.ll),  an  iterative  method  to  find  an  optimal 
stationary  policy  has  been  developed  in  [HOWA  60].  In  the  following  we 
describe  the  algorithm,  called  the  policy-iteration  method,  for  the 
special  case  that  for  all  policies  f e  F,  the  probability  state  tran¬ 
sition  matrix  P(f)  results  in  a  single  chain  Markov  process.  For  multi¬ 
chain  systems,  some  modifications  to  the  algorithm  are  necessary,  see 
[HOWA  60,  71]. 

The  basic  iteration  cycle  in  the  policy- iteration  method  is 
diagrammed  in  Fig.  C.l.  The  upper  box,  the  value-determination,  yields 
the  g  and  v's  corresponding  to  a  given  choice  of  F(f)  and  P(f).  The 
lower  box  yields  the  P(f)  and  F(f)  that  increase  the  gain  for  a  given 
set  of  v's.  We  may  enter  the  iteration  cycle  in  either  box  with  an 
arbitrary  initial  policy  or  an  arbitrary  set  of  v's.  It  is,  however, 
necessary  to  require  that  in  the  policy- improvement  routine,  if  the  deci¬ 
sion  f(i)  for  state  i  given  by  the  old  policy  yields  as  large  a  value 
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for  the  test  quantity  as  any  of  the  other  actions,  the  decision  is  left 
unchanged.  The  stopping  rule  is  as  follows: 

The  optimal  policy  is  reached  (g  is  maximized)  when  the 
policies  in  two  successive  iterations  are  identical. 

The  fact  that  the  algorithm  given  in  Fig.  C.l  terminates  in  a 
finite  number  of  steps  and  that  it  actually  results  in  an  optimal  policy 
is  established  in  [HOWA  60,  7 1 J . 


THE  VALUE-DETERMINATION  OPERATION 

Use  P(f)  and  R( f)  for  a  given  policy  f  to  solve 
N 

g(f)  ♦  v  (f)  =  R. (f)  ♦  £  p  (f)v.(f)  i  =  0,  1,  2,  ....  N 
1  1  j=0  J 

for  all  relative  values  v^(f)  and  g(f)  by  setting 

vQ  (f)  to  zero. 


POLICY- IMPROVEMENT  ROUTINE 

For  each  state  i,  find  the  alternative  action  a  that  maximizes 

N 

M«)  +  E  PH(a)v.(f) 

j=0  J  3 

using  the  relative  values  v^(f)  of  the  previous  policy. 

Then  a  becomes  the  decision  for  state  i  for  the  new  policy 
R^(a)  becomes  R^(f)  and  p  (a)  becomes  p  „  (?) . 

Set  f  to  f. 


Fig.  C.l.  The  Iteration -Cycle  [HOWA  60] 
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APPENDIX  D 


ANALYSIS  OF  BUFFER  ALLOCATION  SCHEMES 
IN  A  MULTIPLEXING  NODE 


In  this  appendix  we  analyze  the  storage  allocation  schemes 
described  in  Section  6.3  and  present  either  explicit  expressions  or 
numerical  algorithms  to  calculate  different  performance  measures. 

For  the  model  given  in  Section  6.3,  the  states  of  the  system 
can  be  characterized  by  vector  n  =  (n, ,  n„,  ....  n„)  where  n  is  the 
number  of  class  r  messages  in  the  system.  Considering  the  fact  that 
the  entire  system  is  a  birth-death  process  [KLEI  75],  then  we  are  able 
to  derive  the  steady  state  joint  probability  distribution.  This  dis¬ 
tribution  obeys  the  product  form  solution  of  the  network  of  queues 
[JACK  57],  [BASK  75]  and  [LAM  76B]  and  the  result  is 

n 

R  p  r  R 

P[n]  =  cxn!  n  ~T  H  e  Fx  »  £  nr  =  n  (D.l) 

r=l  r'  r=l 


where  p  =  A  /yC  and  x  e  (1,2, 3, 4, 5} 
r  r 

The  subscript  x  refers  to  the  scheme  we  use.  The  integers  1 
through  5  refer  to  CS,  CP,  SMXQ,  SMA  and  SMAMXQ  respectively,  and  F  is 
the  set  of  possible  states  for  scheme  x.  C^  is  a  normalization  constant 
defined  so  that  the  probabilities  P[n]  sum  to  one. 


s;1  -  e 

x  ncF 


R  p"1^ 
n!  I]  7TT 
r=l  nr‘ 


n  =  L  ni 


r=l 


(D.  2) 


Notice  that  C^  is  also  the  probability  that  the  system  is  empty. 


In  the  following  we  will  first  calculate  and  then  other 
quantities  of  interest.  In  contrast  to  the  analysis  in  [KAMO  76B[,  we 
will  see  that  except  for  the  CS  scheme,  values  of  Cx  cannot  be  found 
explicitly;  as  an  alternative,  however,  we  give  efficient  algorithms  for 
their  calculations. 

Throughout  the  remainder  of  this  appendix  we  will  use  the  fol¬ 
lowing  notation: 

ar  =  number  of  buffers  allocated  to  class  r  messages  (this  is  used 

in  CP,  SMA  and  SMAMXQ) . 

b^  =  maximum  number  of  buffers  of  the  shared  storage  which  can  be 

occupied  by  class  r  messages  at  any  time. 

B  =  total  number  of  buffers. 

B  =  total  number  of  shared  buffers. 

s 

Notice  that  we  have  the  following  relationships: 

B  =  B  in  CS 
s 

B  =  Y'  a  and  B  =0  in  CP 

4-1  T  S 

Bs  =  B  -  £  ar  in  SMA  and  SMAMXQ. 

For  each  scheme  we  will  consider  two  special  cases,  and  for  each 
case  we  will  study  the  limiting  behavior  of  the  system.  These  cases  are: 

1.  Ar  =  nA°  ,  r  =  1,  2,  ,R  and  r)  +  00 

2.  Only  one  of  the  input  rates,  A^,  grows  to  infinity. 

D. 1  Complete  Sharing  (CS) 


The  set  of  feasible  states  for  this  scheme  is 


The  easiest  way  to  analyze  this  system  is  to  recall  the  wel 1 -known 
property  of  the  Poisson  process  [FELL  50],  namely  the  outcome  of  merging 
a  number  of  Poisson  streams  is  still  a  Poisson  stream.  By  virtue  of 
this  fact,  the  model  to  be  analyzed  reduces  to  an  M/M/1  queueing  system 
with  finite  waiting  room  B,  arrival  rate  A  =  I  Ar,  and  service  rate  pC. 
The  equations  describing  the  behavior  of  the  system  are  well  known 
[KLE1  75]  and  we  report  them  here. 

Ci‘  ■  P(0)'1  ’  1  ~ B.1  <D.«> 

1  -  p 

PB  =  Pr[£nr  =  B]  =  --  '-£i  PB  (0.5) 

L  J  1  -  p 

where  p  =  A/pC  and  A  =  Z  Af 

Notice  that  the  probability  of  blocking  is  the  same  for  all  classes  of 
messages. 

The  marginal  distributions  can  easily  be  found  as  follows: 

Let 

Pr[n  =  k,  n]  =  Pr [number  of  class  r  messages  in  the  system  is  k 

and  total  number  of  messages  in  the  system  is  n] 

then 

Pr [nr  =  k,  n]  =  P[n^»  n2*  •••»  nr_j»  nr+l’  ’ "  ’ 

En;=n-k 

i/r 

n.  SO 
l 

Using  the  value  of  P[n]  from  Eq.  (0.1)  we  have 


/  R  n”4  \  n*1 

•><»)  £  -(n^r)& 

V  n  =n_t  '  i=l  1  ' 


Pr[nr  *  k,  n] 


£  n.=n-k 
xix  1 


which  can  be  written  as 


R  ni 


Pr[nr  =  k,  n]  -  P(0)  Tir^Tr  tf  £  <"  k>!  fl  nTT 


£n  =n-k 
i^r  1 


i=l  i 
i*r 


or  equivalently 


Pr(nr.k,  n]  =  PCO)(J)p‘(|;  p.) 


R  .n-k 


Notice  that  the  (R  -  1)  classes  are  lumped  into  a  single  class  with  input 
R 

rate  5^  A  .  Using  the  facts  that 
i=i,l*k  r 

R 

p  =  %  pi 


Pr [n  =  k]  =  £  Pr[n  =  k,  n] 
r  n=k  r 


we  get 


Pr[nr  ■  k]  =  P(0)  Z[(j)p^p-pr)n-kj 


The  total  average  number  of  messages  in  the  system  is 


CD.  6) 


1  -  p 


1  -  (B  +  l)pB  +  Bp^1 
,  B+l 

1  -  p 


(D.7) 
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and  by  using  Little's  result  [LITT  61],  the  average  time  spenc  in  the 
system  for  the  accepted  messages  is  found  to  be 


For  the  marginal  statistics  we  have  the  following 


n  =  E [number  of  class  r  messages  in  the  system] 


We  have  the  following  results 


where  p 


also 


Further,  we  have 


As  for  the  limiting  throughput,  wc  have 


lim  A'  =  lim  A(1  -  PB)  =  pC 

n  t®  n  t® 

A° 

lim  A'  =  lim  A'(l  -  PB)  =  —  pC  r  =  1,  2, 
n+®  r  nt®  r  A 


ID-  14) 


R  C». 15) 


lim  T  =  BpC  (D. 16) 

nfoo  r 


Eq.  (D.13)  reveals  an  interesting  fact;  namely,  even  if  the 
input  rates  are  scaled  up  to  infinity,  there  is  a  non-zero  probability 
that  one  (or  more)  of  the  classes  of  messages  are  absent  from  the  system. 
This  is  a  distinct  drawback  of  the  CS  scheme. 


Special  Case  II:  A^+® 

From  Eqs.  (D. 4)  -  (D.6)  we  can  get  the  following  results: 


!0  n  i  B 

1  n  =  B 

X 


lim  [n  =  k]  =  < 
Ait«  r  (0 


k  =  0 

,  r  t  i 

k  +  0 


(D. 17) 


(D. 18) 


(0  kM 
lim  Pr[n.  =  k]  =  J 
A.  t®  1  (  1  k  =  B 

l 


(0  r  +  i 

lim  A"  =  lim  A  (1  -  PB)  =  \ 

A .  t®  r  A .  +®  (pC  r  =  i 

l  l  ' 


(11.19) 


(D. 20) 


These  results  show  how  the  system  can  be  dominated  by  a  single  class  of 
high-rate  messages. 
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The  #et  of  possible  states  will  he 


F2  =  {n  |  °  <  nr  i  ar  .  r  *  1,  2,  ...,R}  (D.21) 

and  we  have  B  =  Y.  a  . 

In  this  case  the  computation  of  the  normalization  term  C 2  is 
not  as  simple  as  and  in  fact  no  explicit  expression  for  this  constant 
has  been  derived.  Notice  that  the  summation  in 

r  R  p"r ' 

C2  ■  £  n  ;tt  •  » -  z>r 

-eF2  r=l  r 

is  taken  over  (*^  +  n  ^)  =  (^B^)  Possible  system  states  and  a 

direct  summation  is  out  of  the  question.  Unfortunately,  because  of  the 
n!  terra  we  cannot  even  use  the  generating  function  approach  as  in 
[MOOR  72],  [WILL  74]  or  [KAMO  76B] ,  and  we  must  resort  to  numerical 
techniques.  These  techniques  have  been  studied  by  others  and  interested 
readers  can  refer  to  [BUZE  73],  [KOBA  75],  [WONG  75]  and  [REIS  76A  and 
76B] .  However,  the  problem  at  hand  is  different  from  theirs  and  we  must 
modify  their  techniques. 

We  define  the  following  set  of  states: 

A  ra 

S(n,m)  =  {n  ■  (nj,  n2,  ....  nj  |  £  *T  =  n,  0£  nr<  ar>  r  =  1,  2,  ....  in} 

r=  1 

(D. 22) 

S(n,m),  as  defined  above,  is  the  set  of  all  system  states  in  which  only 
the  first  m  classes  may  be  present  in  the  system  and  the  total  number  of 


E  n  — 

neS(n,R)  r=l  r 


We  also  define  an  auxiliary  function  u(n,m)  as  follows 


neS(n,m) 


where  a* 


a  ,  and  clearly  we  have  B  =  a*  .  C_  is  related  to 


u(n,m)  according  to  the  following  equation 


We  also  have 


In  what  follows,  we  will  present  an  algorithm  to  calculate 


iteratively  u(0,R),  u(l,R) 


u(B,R).  Following  the  approach  of 


However,  because  neS(n,m)  and  n  =  k,  then  k  <  a*  and  we  have 


i  n  #  3  I 

L  *?Jr 

u(n,m)  *  £  £  u(n,«) 

k*0  neSCn.m) 

•  n_=k 


where  |a,y  *  inf  (a,b).  Factoring  out  the  constant  terms,  we  observe 


that  for  m  >  1 


Ln'a™j  (  Pk 

u(n,m)  =  £  !  n!  ry 


Ln*amJ 


"A*  prr]l 


neS(n-k.m-l)  r=l 


a* 


u(n>">  ■  £0  [(k)piu(n-k-m-i)] 


However,  by  definition  (Eq.  (D.23)),  we  should  have  (n-k)  _<  a*  ^ 


or  k  >  (n  -  a*  , )  and  we  have  for  m  >  1 
—  m- 1 


(  e  ro^H  °-n-^ 

1  k=|0,n-a*  7|  L  -* 

/  m-1  (D. 

(,0  n  >  a* 


u(n,m) 


and  for  m  *  1 


(D. 26) 


n  >  a* 


u(n, 1)  = 


u(0,m) 


0  £  n  1  aj 


1  <  m  <  R 


(D*  27) 


(D. 28) 


Eqs.  fi) . 2o  -  D.28)  provide  us  with  an  iterative  scheme  to  calculate 
u(n,R).  One  must  first  initialize  the  first  column  and  row  of  matrix  u 


according  to  Eqs.  (D.27)  and  (D.28)  and  each  element  of  columns  2  to  k 
is  calculated  according  to  Eq.  (D.26). 

Having  calculated  the  matrix  u,  we  can  find  expressions  for 
other  statistics  of  interest.  C2  and  Pr[£nr  =  n]  have  been  already 
defined  in  Eqs.  (D.24)  and  (D.25).  For  the  total  average  number  of 


messages  in  the  system  we  have 


n  =  P(0) 


is 


(n,R)n 


(D.29) 


We  can  find  the  marginal  distribution  of  the  message  class  of 
the  last  column  of  the  matrix  u  (class  R).  From  Eq.  (D.l)  we  have 

R  p"r 

Pr[nR  =  k]  =  £  P(0)n!  JJ  JTJ 

neF2  r=l  r' 


When  the  number  of  class  R  messages  in  the  system  is  k,  the  total  number 
of  messages  in  the  system  is  at  least  k  and  at  most  (a?  .  +  k) .  Using 

K-l 

the  definition  of  S(n,R)  we  have 


k+a*  n 

R-1  ^  R  P_r 

Pr[ii  =  k]  =  £  £  P(0)n!  n  7TT 

n=k  neS(n,R)  r=l  r 


After  factoring  out  the  constant  terms  and  some  rearrangments  we  get 


k*“R-l 


Pr[nR.k]  «  PCO) 


which  can  be  reduced  to 


k  n* 

T  nlpR  .  t1  pr 

iw  k'Oi-k)1  *  n  n  [ 

neS(n,R)  KU1  KJ 1  r*l  nr! 
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k+aR-l  ( 

Pr[nR.k]  -  P(0)  (J)pJ 


R-l  p  rl 

£  (n-k)1  n,  rr 

LnrS(n-k,R«-l) 


•  II  • 

r*l  r 


Finally,  using  u(n-k,  R-l)  as  defined  in  Eq.(D.23)  we  get 

k+a*  j 

Pr[nR=k]  =  P(0)  |jJ)u(n-k,R-l)J  pk 

In  particular  the  probability  of  being  absent  from  the  system  is 


(D.  30) 


R-l 


Pr[nR=  0]  =  P(0)  £  [u(n,  R-l)  ] 

n=0 


(D. 31) 


and  the  probability  of  blocking  of  the  R**1  class  will  be 

PBR  ’  PrIl'R  =  *Rl  •  P<°>  £  f(a”  )u‘"-aR'R-» 

n=aD  L  *  R' 


CD. 32) 


From  Eq.  (D.30)  we  can  find  other  measures  of  interest  such  as  throughput 
of  the  Rttl  class,  XR;  average  number  of  class  R  messages  in  the  system, 
nR,  etc.  In  order  to  find  statistics  of  other  classes  of  messages,  we 
have  to  rearrange  the  ordering  of  classes  and  position  the  class  of 
interest  as  the  Rt^1. 


Special  Case  I:  A  =  qA  ,  r  *  1,  2,  ...,R  and  n  + " 
We  have  the  following  limiting  statistics: 


and 


lim  Pr[£ 
T)foo 


l im  Pr  [n^  = 
nfoo 


■■■■'•I! 

,.,.r 


if  n  ^  B 

if  n  =  B 

if  k  /  a 

i 

if  k  =  a 


(D. 33) 


(D. 34) 
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_ _ 


.Jj 


Comparing  Eq.  (D.34)  with  (D.13)  we  notice  that  the  CS  scheme  does  not 
display  this  property. 


For  the  limiting  throughput  we  have 


lira  =  yC  (D. 35) 

nf°° 


a 

lim  X'  =  lim  X  (1  -  PB  )  =  yC  (D.36) 

n+°°  n+°°  r  r 


which  shows  that  in  the  limit,  each  class  will  use  a  portion  of  the  out¬ 
put  channel  equal  to  the  fraction  of  the  storage  that  it  is  using. 


Special  Case  II:  X^  +  <*> 

In  contrast  to  the  CS  scheme  in  which  all  of  the  other  classes 
would  be  denied  any  service,  the  limiting  throughput  of  other  classes 
is  not  zero.  The  expressions  for  the  limiting  probabilities  of  block¬ 
ing  and  other  measures  are  in  general  very  complicated;  we  derive  these 
expressions  for  the  case  R=  2.  Consider  a  storage  of  size  B  with  two 
input  classes  with  (normalized)  rates  and  p2,  and  let  a^  and  a2  be 

the  allocated  number  of  buffers  to  these  two  classes  (a^ +  a2  =  B) .  The 
probability  of  blocking  of  each  class  can  be  found  by  using  Eq.  (D.30). 


tcy^p 

n=a.'  l'  J 


PB.  = 

l 


B 

£ 

n=0 


£’a2j 


E/n\  k  n-k 
(  kjP2Pl 


k=|0,n-aj~| 


i> j  =  1,2 
i  /  j 


Assume  that  Xj  is  kept  fixed  and  X2+<».  In  the  limit  n2  =  a2  and  we 
have 
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1 


lim  PB-  = 
^2+°° 

The  limit  of  PB^  and  X^  t »  will  be 


Therefore  we  have 


The  limiting  throughput  of  class  1  is 

\[  =  Xjd-  PBj) 

Because  the  input  rate  of  class  2  is  infinite,  we  have 


(D. 37) 


X 


2 


Therefore  we  have 
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The  above  expressions  show  that  if  one  of  the  input  rates  grows  to 
infinity  it  does  not  prevent  the  other  class  from  using  the  system. 
This  was  not  the  case  for  the  CS  scheme. 


D.3  Sharing  with  Maximum  Queue  Length  fSMXQ) 

As  we  explained  in  Section  6.3,  in  SMXQ  a  pool  of  buffers  is 
shared  by  different  classes  of  messages  in  such  a  way  that  there  is  a 
limit  on  the  number  of  messages  of  each  class  which  can  be  present  in 
the  system  simultaneously.  The  set  of  feasible  states  is 


R 

f3  =  ln  I  £  n  <  B 

r=  1 


0<n  <b,  r  =  1,  2,  . . . ,  R} 

—  r  —  r 

(D. 39) 


where  br  is  the  maximum  queue  length  for  class  r  messages.  Clearly,  if 

£b  <  B  this  scheme  reduces  to  CP,  and  if  b  >  B,  r=  1,  2, ,  R  ,  it 

r  —  r  — 

will  be  identical  to  CS. 

To  calculate  the  normalization  constant  C^  and  other  statistics 
as  we  did  for  the  CP  scheme,  we  must  resort  to  numerical  techniques. 

The  algorithm  for  this  calculation,  except  for  minor  differences,  is 
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similar  to  the  one  for  the  CP  scheme  which  we  shall  briefly  explain. 

The  sets  S(n,m)  and  the  auxiliary  function  u(n,m)  will  be 
defined  as  follows: 


A  11 

S(n,m)  s  {n  =  (n^  n2,  ....  n^)  |  5^  nr  =  n*  °lnrlbr'  r*  2>  •••.")} 


nr 

u(n,m)  =  2-»  n!  FI  7TT  »  n 

neS(n,nO  r"  nr! 


(D. 40) 


(D.41) 


where  b*  = 
m 


m 


By  using  an  approach  very  similar  to  CP,  we  have  for  m>  1 


u(n,m)  = 


mJ  r  i 

E  (k)PraU(n-k’n,-1)  n±LB’b, 

^ro.n-jB.a^J]  L 

0  n  >  iB,b 


10.0  | 

L  il 


and  for  m  =  1 


u(n,l)  =  pj  0  <  n  <  [_B»biJ 


(D.42) 


(D. 43) 


u(0,m)  =  1  m  =  1,  2,  . . . ,  R 


(D. 44) 


where  as  before,  ja,b|  =  inf(a,b)  and  [a,bj  =  sup(a,b) 

Having  calculated  the  matrix  u,  we  can  find  the  statistics  of 
interest.  In  particular 


Cl1  *  P(0)_1  =  £  u(n.R) 
n=0 


(D.45) 


/ 
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Pr 


[s  'v ' " 


=  P(0)u(n,R) 


(D. 46) 


n  =  E [number  of  messages  in  the  system]  =  P(0)  u(n,R)n 

n=l 


For  the  marginal  statistics  of  class  R  we  have 


(D.47) 


Pr[nR=k]  = 


LB>bR-»*kJr  ,  ,  „ 

£  [(k)u(n'k'“'1)]pR 


£  u(n,R) 
n=0 


0  i  k  i  LB'bR] 


(D.48) 


In  particular 


PrlnR=0]=  g 


LBkbR-ij 

£  u(n,  R-l) 
n=0 


(D.49) 


V  u(n,R) 
n=0 


The  blocking  probability  of  class  R  messages  is  the  probability  that 
either  the  entire  storage  is  full,  or  nD  is  equal  to  bD.  This  leads  to 


LB'1,bRj 

u(B,R)  +  H 
n=b„ 


(b"R)u("-bR’R-1) 


PIL 


u 

5* 


(n,R) 


In  order  to  find  marginal  statistics  of  other  classes,  we  must 
rearrange  the  ordering  of  the  classes.  Now  we  consider  two  special 
cases : 


iHMMMMNHIMMn 
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Special  Case  I:  Xy  =  nX°  ,  r=  1,  2,  ....  R  and  n  +  ® 
In  this  case  we  have 


lira  PB  x  1 
nt®  r 


r  =  1,  2,  ....  R 


lim  Pr 
n+°° 


for  0  <  k  <  B-  b*  j 
for  k  ^  B  -  b*  j 


In  particular 


(D.51) 


lim  Pr[n  =  0]  =  0  if  B  >  b 

ru®  K  R 

This  shows  that  in  order  to  guarantee  that  at  infinite  input  rate  (n  t  <») 
there  is  always  one  or  more  packets  of  a  certain  class  of  messages,  say 
r,  present  in  the  system,  we  should  have 


B  >  £  b 
i=l  1 
i^r 

This  is  one  of  the  motivations  behind  using  the  SMXQ  scheme.  For  the 

throughput  of  the  class  R  messages  we  have  for  b*  >  B 

K  . 


h(B 


■l.r)  -  (Bb‘  *)  h(B-l-bR,  R-l)  p 


lim  X' 


+<» 
o  ,  o 


h(B,R) 


where  p.  =  X:/pC  and  h(n,m)  =  u(n,m)  |  ;  and  for  B  >  b* 

1  X  _  0  K 


p.=p. 
1  1 


(D. 52) 


lim  Xr  =  M  uC 
R 


which  is  the  same  as  for  the  CP  scheme. 


(D. 53) 
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1  ] 

Special  Case  II:  Ai  +  00 

If  B,  then  intuitively 


lim  PB  =1  r  =  1,  2,  . . . ,  R 
A.+®  r 

l 

and 

(0  r  t  i 

lim  \'  =  <  (D.54) 

Ai+»  r  (  yC  r  =  i 

This  behavior  is  similar  to  the  CS  scheme.  If,  however,  b.^  <  B,  then 

!1  r  =  i 

(D.55) 

<1  r  i  i 

j. 


In  fact,  b^^  buffers  will  be  occupied  by  class  i  messages  with  proba¬ 
bility  one  and  the  B  -  b^  remaining  buffers  will  be  shared  by  the  other 
classes  (according  to  the  SMXQ  scheme).  The  expressions  for  the  limit¬ 
ing  probabilities  are  complicated  and  we  only  present  the  expressions 
for  the  case,  R  =  2. 


* 


and 
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« 


lim 

A2+00 


T 


■S,  (>,) 

A  (•', ) 


n+1 


MC 


UC 


r  =  1 


r  =  2 


(D.S7) 


Comparing  Eqs.  (D.56)  and  (D.57)  with  Eqs.  (D.37)  and  (D.38),  we  notice 
that  in  this  case  the  behavior  is  like  the  CP  scheme  when  a  pool  of  B 
buffers  is  partitioned  into  (B  -  bj)  and  b2  buffers. 


D.4  Sharing  with  Minimum  Allocation  (SMA) 

As  Eq.  (D. 51)  suggests,  in  order  to  guarantee  that  at  infinite 
input  rates  one  or  more  packets  of  a  certain  class  of  messages  will 
always  be  in  the  system,  at  least  one  buffer  storage  must  be  permanently 
allocated  to  that  class  of  messages.  This  is  the  idea  behind  the  SMA 
scheme  in  which,  out  of  a  pool  of  B  buffers,  a^  (r  =  1 ,  2,  . . . ,  R) 
buffers  are  permanently  allocated  to  class  r  messages,  and  the  remaining 
Bs  =  B  -  £  af  buffers,  similar  to  the  CS  scheme,  are  shared  by  all  of 
the  classes.  As  a  result,  the  set  of  possible  states  will  be 


P4  ■  {n  a  (nr  n2,  ....  nr)  |  f0>  nr"ar^-Bs;  °fnr-Bs  +  ar* 

r=  1,  2,  ....  R} 

where  Bg  «  B  -  £ar  and  [a,b]  *  sup  (a,b). 


(D. 58) 
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Calculation  of  the  normalization  factor  is  more  complex  than 
for  the  previous  schemes  as  we  will  see  shortly.  We  start  with  ordering 
the  classes  in  an  arbitrary  manner  and  define  the  following  sets: 

A 


A  { 

T(b,n,m)  =  |n  =  (n^  n2>  . . . ,  nj  |  £  nf  =  n  ; 


m 

T.  [0 >  n  -  a  ~|  =  b  ;  0  £  n  <_  b+a  ,  r  =  1 ,  2 , _ ,  m  l 

r=l  r  r  r  f 


where  0  .1  b  £  Bs»  0<nj«B,  l^m^R 


(D. 59) 


T(b,n,m)  is  the  set  of  states  in  which  there  are  no  messages  of  classes 
m+1,  m+2,  . ..,  R  in  the  system,  the  total  number  of  messages  in  the 
system  is  n,  and  the  number  of  messages  in  the  shared  storage  is  b. 
Instead  of  a  two-dimensional  matrix  u,  we  must  use  a  three-dimensional 
matrix 


t(b,n,m) 


»  E 

neT(b,n,m) 


m 


n 

r=l 


m 

n  =  r  n  CD, 60) 
r=l 


To  derive  a  general  expression  for  t(b,m,m)  as  we  did  for  SMXQ,  we  start 
with  conditioning  on  the  number  of  class  m  messages  in  the  system 


n 

t(b,n,m)  =  £ 
k=0 


n! 


neT(b,n,m) L 
n  =k 


m 

Factoring  out  the  constant  terms,  we  get 


I 

( 


t(b,n,m) 


E 

n£T(b,n,m) 


m-1 


n 

r=l 
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The  inner  summation  should  be  taken  over  all  states  in  which  the  number 


of  messages  in  the  system  is  n  and  the  number  of  messages  in  the  shared 


buffers  is  b,  and  when  there  are  k  messages  of  class  m  in  the  system 


The  above  equation  can  be  rearranged  as  follows 


neT(b,n-k,m-l) 


neT(b-k,n-(k+a  ) ,m-l) 


In  the  first  summation  there  are  k  <  a  class  m  messages  in  the  system 


hence  there  are  no  messages  of  class  m  in  the  shared  buffer.  In  the 


second  summation  there  are  k  +  a  class  m  messages  in  the  system,  hence 


there  are  k  ( >  0)  class  m  messages  in  the  shared  buffer.  After  some 


algebra,  and  using  the  definition  of  t(b,n,m),  we  can  derive  the  follow 


ing  expression  for  the  element  of  matrix  t 


Expressions  for  the  limiting  probabilities  when  A^  =  nA^  , 

. ,  R  and  n  + 00  are  very  complex  and  are  not  presented  here 


For  the  special  case  that  one  of  the  input  rates  grows  to  infinity  we 


Sharing  with  Minimum  Allocation  and  Maximum  Queue  (SMAMXQ) 


In  this  scheme,  as  in  the  SMA  scheme,  a  minimum  number  of  buf 


R)  are  allocated  to  each  class  and  there  is  also 


a  limit  (b 


R)  on  the  number  of  messages  of  each  class 


which  can  be  present  in  the  shared  storage.  The  set  of  feasible  states 


I 


The  analysis  of  this  scheme  is  quite  identical  to  the  SMA 

scheme.  We  start  by  defining  the  following  sets  for  any  ordering  of 

the  classes  of  messages. 

A  <  mm 

T(b,n,m)  =|n  =  (nj,  n2>  n3>  . . . ,  n&)  |  £  nr=n>  £  nr"  =  b; 

0  i  "r  <  s*  [Vj  •  r  =  1-2 . •} 

(D. 71) 

As  in  SMA,  T(b,n,m)  is  the  set  of  states  in  which  there  are  no  messages 
of  classes  m+1,  m+2,  ....  R  in  the  system,  and  there  are  n  messages  in 
the  system,  of  which  b  are  in  the  shared  buffer.  We  also  define  a 
three-dimensional  matrix  t  as  follows: 


t(b,n,m) 


»  E 

neT(b,n,m) 


m 


n 

r=l 


n  1 
r 


(D.72) 


With  an  approach  similar  to  SMA  we  can  find  the  expression  for  the 
values  of  the  matrix  t 


Ln,a»J 

/  _  E  . 


(k)P»t(b’"'k,m"1)] 


t(b,n,m)  =  (  + 


m 


in-a  ,b,b  i 
I  m’  ’  mj 


E 

k=l 


(k  Ta  )  pl  t(b-k‘n-  Ck+am),m-l) 


if  n  <  a*  +  ib,b* 


—  m 


1_  nJ 


if  n  >  a*  +  ib,b*i 
m  |_  ’  mj 


CD. 73) 
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Having  found  matrix  t,  we  can  calculate  the  performance  measures  of 


interest.  The  expressions  for  these  measures  are  exactly  similar  to 
the  ones  for  the  SMA  scheme  and  we  do  not  present  them  here. 

A  summary  of  the  results  presented  in  this  appendix  was  orig 


inally  published  in  [KERM  77] 
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